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1. Introduction 



The word and conjugacy problems are the most classical algorithmic problem for groups 
going back to the work of Dehn and Tietze at the beginning of the 20th century. These 
very basic problems are interested by themselves and because of applications to other areas 
of mathematics, especially topology (where the analogs of these problems are problems of 
existence of pointed or free homotopy between loops). There are very many papers devoted 
to these problems for various classes of groups. Different aspects of these problems are 
discussed in many books and surveys (see, for example, [Ml ESI EQl [72l [78l [HI 11371 11471 
11621 11811 11871 12071 1208| ) . Still there are very many new results and ideas that appeared after 
the last of these surveys was published. We can mention, for example, the construction of 
finitely presented groups with polynomial-non-recursive and even quadratic-non-recursive 
Dehn functions |182j . finding a nilpotent finitely generated group with Dehn function not 
of the form n" for any a |235| . the proof that all groups SL„(Z), n > 5, the R.Thompson 
group extensions of finitely generated free groups by cyclic groups, have quadratic Dehn 
functions [2391 HHl [36] , the description of all FFFL (space) functions of finitely presented 
groups and the algebraic characterization of groups with PSPACE word problem |183j . 
solving the isomorphism problem for arbitrary hyperbolic groups [65] and many others. 
Several new methods are used in the proofs of these results and our goal in this survey is to 
give as gentle as possible an introduction to these results and methods. To this end, we often 
present not the results in their full generality but their easier to explain approximations. In 
this regard, this survey is similar to our survey [137] with Olga Kharlampovich. 

Another feature of this survey is the emphasis on "related problems". For example we 
consider hyperbolic groups (Section 13. ID . This is the class of groups with the smallest 
possible (linear) Dehn functions. The word problem in a hyperbolic group can be solved 
in linear time (in fact by a real time Turing machine |123j ). We mention that this class 
is very large: almost all finitely presented groups are hyperbolic. This leads us to the 
discussion of various probabilistic models used to clarify the words "almost all", to small 
cancelation conditions (including coarse small cancelation conditions over hyperbolic group) , 
to different ways of constructing hyperbolic groups, Gromov-Olshanskii theory of quotients 
of hyperbolic groups and various combination theorems, to various monsters constructed 
as limits of hyperbolic groups and to Gromov's random groups. We also discuss various 
weakening of the linear isoperimetric inequality including Wenger's result from [234] and 
Cartan-Hadamard local-to-global type theorems of Gromov j98j and others. Similarly, when 
we consider groups with quadratic Dehn functions (Section [J]), we describe the very diverse 
zoo of examples of these groups, present ideas of the proofs of results from [239], [lllj and 
others. Then we notice that all known groups with quadratic Dehn functions (except for 
automatic groups) are known to have solvable conjugacy problem although the solvability 
of conjugacy problem was proved by very different methods in different classes of groups. 
So we formulate and discuss a general problem (due to Rips). 
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Most of the paper is devoted to the Dehn functions which measure the time complexity 
of the word problem. One can argue that after the growth function (first introduced by 
A. S. Svarc in [226], and later by Milnor |164| ) which counts the number of different elements 
of the group that can be represented by products of generators of length < n, this is 
possibly the most important and basic geometric invariant of a groupQ We also discuss 
the space complexity and the recent results of Olshanskii |183| . We present an algebraic 
characterization of groups with word problem in NP from [22] and groups with word problem 
in PSPACE from |183| . discuss examples of groups with NP-complete |210j and coNP- 
complete [21] word problems. 

Note that we could not survey all aspects of complexity of the word and conjugacy 
problems in groups. For example we do not talk about the average case complexity, generic 
solutions of algorithmic problems, search problems and other complexity related issues used 
in group based cryptography |168j . Fortunately, there is a nice recent survey by Shpilrain 
[219j where at least some of these topics are discussed. 

Acknowledgement. I am grateful to Efim Zelmanov who inspired me to write this 
survey. Several people contributed with suggestions, comments and even pieces of text. I 
am especially grateful to Martin Bridson, Frangois Dahmani, Daniel Groves, Victor Guba, 
Sergei Ivanov, Bruce Kleiner, Igor Lysenok, Alexander Olshanskii, Denis Osin, Tim Riley, 
Stefan Wenger and Robert Young. 

2. Algorithmic problems in groups 

Let X be a set. A word over X is a sequence of elements of X. A group word over X 
is a sequence of elements of X and their inverses, i.e. symbols x~^, x G X. The length 
of a word W is denoted by A group word is called reduced if it does not contain 

subwords of the form xx~^ and x~^x, x £ X. Every group word can be made reduced by 
removing all subwords of these forms. The set F(X) of all reduced group words over X is 
equipped with the binary operation: the product UV of two reduced group words U, V is 
the result of reducing the concatenation of U and V. With this operation, the set F(X) 
turns into a group which is called the free group over X. The identity element of that group 
is the empty word denoted by 0. For every group G, any map X ^ G extends uniquely 
to a homomorphism F{X) — ?• G: any word is mapped to the product of the images of its 
letters. In particular, every group G generated by at most \X\ elements is a homomorphic 
image of F{X) so that the image of X generates G. Let (j) be one of these (surjective) 
homomorphisms. 

The word problem in G (related to (p) is the following 

Problem 2.1. (The word problem). 

Input: A reduced group word W over X. 

Output: "Yes" if 0(W^) = 1 in G and "No" otherwise. 

Let us also introduce a "close cousin" of the word problem, the conjugacy problem. We 
say that two elements a, 6 of a group G are conjugate if there exists an element t (called a 
conjugator) such that tat^^ = b. 

The conjugacy problem in G (related to (j)) is the following 

Problem 2.2. (The conjugacy problem). 

Input: Two reduced group words U, V over X. 

Output: "Yes" if 4>{U) and (f){V) are conjugate in G and "No" otherwise. 

^Dehn functions are possibly even more important than growth functions because the set of Dehn functions 
is much more diverse than the set of growth functions (see Section Q, so Dehn functions capture more 
information about a group. 
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In principle, the existence of an algorithm to solve Problem 12.11 or [2^2] depends not only 
on G but also on (f). There is, however, an important case when (p does not matter. Suppose 
that X is finite. Then G = (f){F{X)) is called finitely generated. Let N be the kernel of 
<j). It is a normal subgroup of F{X). Suppose that N is generated as a normal subgroup 
by a finite set R ("generated as a normal subgroup" means that every element of is a 
product of conjugates of elements of R and their inverses). Then we say that G has a finite 
presentation {X \ R). For finitely presented groups the solvability of the word or conjugacy 
problem (i.e. the existence of the needed algorithm) does not depend on the choice of or 
even on the choice of the finite set X (as long as (j) is surjective). 

Note that the conjugacy problem is stronger than the word problem, that is if the con- 
jugacy problem is solvable in G, then the word problem is also solvable. Indeed, only the 
identity element can be a conjugate of the identity element. 

2.1. Van Kampen diagrams. 

2.1. A. The definition. Let G be a finitely presented group given by a finite set of generators 
X and a finite set of relators R, cj)\s a homomorphism from F{X) onto G, N is the kernel of 
this homomorphism. Then words in N are precisely products of generators of G and their 
inverses that are equal to 1 in G. The word problem for G is then the membership problem 
in N for the elements of F{X). Note that if the product uv is in N then vu is also in N 
(as a conjugate of uv). The word vu is called a cyclic shift of uv. We shall always assume 
(without loss of generality) that R consists of cyclically reduced words, that is group words 
all of whose cyclic shifts are reduced. Since is generated by i? as a normal subgroup, a 
reduced group word W from F{X) is in A^ if and only if W can be represented in the free 
group as a product of conjugates of elements of R and their inverses: 

m 

(1) W = J{s,rfh-^ 

i=l 

where Si G F{X), ri € R. 

The minimal number m in all representations ([1]) of is called the area of W for the 
reasons explained below. For every representation ([1]), we can draw a planar diagram, a 
bouquet of "lollipops" which is a planar labeled graph. Each "lollipop" corresponds to one 
of the factors SjrjS^^, it has a stem, a path labeled by Si (i.e. the stem is subdivided into 
edges labeled by the letters of Sj), and a candy, a cycle path labeled by (see Figured]). 

Going counterclockwise around the "lollipop" starting and ending at the tip of the stem, 
we read SjrjS"^. Thus going counterclockwise around the diagram which is the bouquet of 
"lollipops", we read the word which is the right hand side of ([T]). 

In order to make the word W from this word, we need to reduce the boundary of the 
bouquet of "lollipops" (the boundary is traced counterclockwise): every time we see a pair 
of consecutive edges on the boundary of the diagram, which have the same label and the 
same initial or terminal vertex (see Figure [2]), we identify these two edges (if the edges 
have both vertices in common, we identify the two edges and remove the whole subgraph 
bounded by them on the plane). This amounts to removing a subwords xx~^ and x~^x 
from the right hand side of ([1]) . The resulting picture is a van Kampen diagram for W over 
the presentation {X \ R), that is a planar graph with edges labeled by elements of X, the 
boundary of each cell (i.e. the closure of a bounded connected components of the plane 
minus the graph) labeled by words from R^^, and the boundary of the whole graph (i.e. 
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Figure 1. Using lollipops to build a van Kampen diagram 



the boundary of the infinite component of the plane minus the graph) is labeled by W (see 
[147] for a slightly different definition and [175j for another one). 




Figure [3] shows how a typical van Kampen diagram may look like. One can see that the 
cells may be of different sizes and shapes, a cell can touch itself, etc. It is important also that 
a diagram itself may not be an embedded disc: several pieces as on Figure[3]can be connected 
by paths to form a tree of discs (as the bouquet of lollipops in Fig[T|). Nevertheless it is a 
planar graph, and there are various general methods to study van Kampen diagrams (the 
whole book |175| is essentially about the study of van Kampen diagrams used to construct 
groups with extreme properties such as infinite bounded torsion groups, Tarski monsters, 
etc.). 
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Figure 3. A van Kampen diagram without labels. 



Thus we have shown 

Lemma 2.3 (The first half of the van Kampen lemma). // a reduced group word W over the 
alphabet X is equal to 1 in G, then there exists a van Kampen diagram over the presentation 
of G with boundary label W. 

This is a half of the so called van Kampen lemma [1471 1175^ [35] . The converse statement 
(that the boundary label of every van Kampen diagram is equal to 1 in G) constitutes 
the second half. In order to prove it, we have to "undo" the construction of van Kampen 
diagram above, and, given a van Kampen diagram, produce an equality of the form ([1]). 
The proof is by cutting a van Kampen diagram along the edges, to produce a tree of 
"lollipops". It is obvious that this can be done somehow. In |187j . we presented a useful 
and economical way to cut a van Kampen diagram (so that every edge is used at most four 
times). Since cutting van Kampen diagrams into nice pieces is the main ingredient of many 
proofs involving van Kampen diagrams (see Section [4.61 below, for example), we reproduce 
the proof here. 

Note that if a word W can be represented in the free group in the form 

UiriU2 . . . UmrmUm+l 

where uiU2 ■ ■ ■ lim+i = 1 in the free group, then W is equal (in the free group) to 

(uiriu5"^)(uiU2r2U2 ^) ■ ■ ■ (Ul . . . UmTmUm ■ ■ ■ Ul^)^! ■ ■ ■ UmUm+1 
= {uiriU^^){uiU2r2U2^U^^) . . . (ui . . . UmrmU':^ ■ ■ ■ Ui^) 

and so W £ N. The converse statement is obvious. 

Proposition 2.4 (The second half of the van Kampen lemma). Let A be a van Kampen 
diagram over a presentation {X \ R) where X = X~^ , R is closed under cyclic shifts and 
inverses. Let W be the boundary label of A. Then W is equal in the free group to a word 
of the form uiriU2r2 ■ ■ ■ Umi~dUm+i where: 

(1) Each ri is a cyclic shift of a word from R^^; 

(2) uiU2 ■ ■ ■ lim+i = 1 in the free group; 

(3) Si^V 1^*1 — where e is the number of edges of A. 

In particular, W is in the normal subgroup N and is equal to 1 in G. 

Proof. If A has an internal edge (i.e. an edge which belongs to the boundaries of two cells) 
then it has an internal edge / one of whose vertices belongs to the boundary. Let us cut A 
along / leaving the second vertex of / untouched. We can repeat this operation until we get 
a diagram Ai which does not have internal edges. It is easy to see that the boundary label 
of Ai is equal to W in the free group. The number of edges of Ai which do not belong to 
contours of cells (let us call them edges of type 1) is the same as the number of such edges 
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in A and the number of edges which belong to contours of cells in Ai {edges of type 2) is 
at most twice the number of such edges of A (we cut each edge from a contour of a cell at 
most once, after the cut we get two external edges instead of one internal edge). 

Suppose that a cell 11 in Ai has more than one edge which has a common vertex with IT 
but does not belong to the contour of H. 




Figure 4. Cutting 11' inside H. 



Take any point O or0 c^(n) which belongs to one of the edges not on 9(11). Let p be the 
boundary path of Ai starting at O and let q be the boundary path of 11 starting at O. 
Consider the path qq~^p. The subpath q^'^p bounds a subdiagram of Ai containing all cells 
but n. Replace the path q in qq~^p by a loop q' with the same label starting at O and lying 
inside the cell 11. Let the region inside q' be a new cell H'. Then the path q'q~^p bounds a 
diagram whose boundary label is equal to W in the free group F{X). Notice that H' has 
exactly one edge having a common vertex with H' and not belonging to the contour of H' 
(see Figure H]). Thus this operation reduces the number of cells H such that more than one 
edge of the diagram has a common vertex with H but does not belong to the contour of H. 

After a number of such transformations we shall have a diagram A2 which has the form 
of a tree T with cells hanging like leaves (each has exactly one common vertex with the 
tree). 

The number of edges of type 1 in A2 cannot be bigger than the number of all edges in 
Ai, so it cannot be more than two times bigger than the total number of edges in A. 
The boundary label of A2 is equal to W in the free group, and it has the form 

where m is the number of cells in A, U1U2 ■ ■ ■ Um+i is the boundary label of a tree (traced 
counterclockwise), so U1U2 ■ ■ -Um+i = 1 in the free group. The sum of lengths of Uj is at 
most four times the number of edges in A because the word U1U2 ■ ■ ■ Um+i is written on the 
tree T, and when we travel along the tree, we pass through each edge twice. □ 



'We use d to denote boundaries of cells, diagrams, subgraphs, etc. 
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2.I.B. The O-cells. One can modify van Kampen diagrams to make all cells embedded discs 
(so that the boundary of a cell does not touch itself) and to make the whole diagram an 
embedded disc by introducing the so-called O-cells |175j . i.e. cells corresponding to relations 
1-1 = 1 and 1 • a • 1 • = 1 (see Figure [5|). 

a 

^/^^ 1 1 

1 a 
Figure 5. O-cells 

The edges labeled by 1 are called 0-edges. Indeed, adding these relations does not change 
the group. Now, for every cell in the van Kampen diagram (the cell is drawn with thick 
lines on Figure [6]), we draw an embedded disc inside the cell, label its boundary by the 
same word which labels the boundary of the original cell, and connect every vertex of the 
boundary of the new cell with the corresponding vertex of the boundary of the original 
cell by a 0-edge. Thus we replace every cell of the original diagram by a new cell with the 
same boundary label and several O-cells. The new van Kampen diagram will have all cells 
embedded discs. Similarly, if a diagram is not an embedded disc, say, it consists of two disc 
subdiagrams connected by a path which is a part of the boundary of the diagram, we can 
replace that path by a sequence of O-cells. As a result, we can make the whole diagram an 
embedded disc. When we count the number of cells (edges) in a diagram we usually do not 
count O-cells (0-edges). In particular 0-edges do not affect the calculation of the length of 
a path in a van Kampen diagram. 




Figure 6. Inscribing an embedded cell inside a non-embedded cell of a van 
Kampen diagram. 

The O-cells were introduced by Olshanskii |175j in order to make his proofs cleaner, 
but they proved to be a very useful technical tool. Let us mention just two applications. 
First, O-cells were used in solving quadratic equations in free groups in jl74| (see 12. 3. Bp 
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and subsequent papers (there are no non-O-relations in the standard presentation of the 
free group). Second, a natural generahzation of 0-cells was a crucial tool in constructing a 
finitely presented non-amenable group without free subgroups |188j (see I5.4p . 

2. I.e. Van Kampen diagrams and tilings. An elementary school problem and its non- 
elementary solution. As an easy application of van Kampen diagrams, consider the following 
elementary problem. 

Example 2.5. Let P he the standard 8x8 chess hoard with two opposite squares removed 
(Figure^. Prove that P cannot he tiled by the standard 2x1 dominos. 



a a 



a a 



Figure 7. The chess board with two squares removed and two dominos. 



Proof. The elementary solution of this problem is well known. Color squares of P in black 
and white in the usual (chessboard) way. Then the number of squares of one color (black or 
white) in P differs from the number of squares of the other color by 2. Since each domino 
covers exactly one white and exactly one black square, P cannot be tiled by dominos. 

Here is a solution which, although less elementary, can be applied to many regions of the 
plane square grid for which the elementary proof above does not work; it also applies to 
regions of non-square (say, hexagonal) lattices on the plane. This solution first appeared in 
a paper by W. Thurston [227] . The ideas of that paper have many applications in several 
areas of mathematics from combinatorics to probability to mathematical physics. 

Let us label horizontal edges pointed rightward in P by the letter a and vertical edges 
pointed upward by the latter b. Then the (counterclockwise) boundary of P has label W = 
aJb'^ a~^ha~'^b~'^ ah. Every domino can be placed either vertically or horizontally. In the first 
case its boundary label is ab^a~^b~^ , and in the second case its boundary label is a^ba'~^b~^ . 
Now consider the group G with the presentation {a,b \ ab^a~^b~^ = l,a^6a~^6~^ = 1). 
Suppose that P can be tiled by the dominos. Then every such tiling turns P into a van 
Kampen diagram over the presentation of G. Hence, by the van Kampen lemma (more 
precisely by Proposition 12. 4p . the word W would be equal to 1 in G. But consider the 
6-element symmetric group 5*3 and two permutations a = (l,2),/3 = (2,3) in it. Clearly 
both relations of G hold if we replace a by a and b by /?. Hence the map a i— )■ a, 6 i— )■ /3 
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extends to a homomorphism G — )■ S^. Note that W{a,l3) = = a/3 is the permutation 

(1,3,2) which is not trivial. Hence W is not equal to 1 in G, a contradiction. □ 

This example shows a similarity between the word problem in groups and the tiling 
problem. Indeed, by the van Kampen lemma, a word W is equal to 1 in G = {X \ R) if 
and only if we can tile a disc with boundary labeled by W by pieces whose boundaries are 
labeled by words from R. Nevertheless the word problem differs from the tiling problem 
as in Example 12.51 because, as we have seen, when we draw a van Kampen diagram, we do 
not fix the shapes or sizes of the tiles, only the labels of the boundaries of them, so one can 
view a general van Kampen diagram as a tiling of a disc by tiles made of soft rubber, while 
the traditional tiling problems such as Example 12.51 are about tiles made of hard plastic. 

2.2. The computational complexity and algebraic systems. 

2. 2. A. Solvability of algorithmic problems: theory and practice. The solvability of an algo- 
rithmic problem does not mean that the problem can be solved in practice. First of all the 
existence of an algorithm does not mean that it is readily available. For example, the word 
problem in every finite group is certainly decidable. But there is no algorithm that, given 
a finite set of relations = 1, i = l,...,n, and a relation r = 1 over a finite alphabet X, 
would decide whether every finite group generated by X where the relations rj = 1 hold 
{i = 1, . . . ,n) also satisfies r = 1. That is a result of Slobodskoj |221j . Thus there is no 
uniform algorithm solving the word problem in all finite groups at once. In general we 
usually ask for existence of an algorithm solving certain algorithmic problem but not for a 
procedure to find this algorithm. (In most concrete cases, if an algorithmic problem in a 
given group is decidable, we are able to write down an algorithm that solves the problem.) 

Another, more important, obstacle is that the algorithm can be too slow. For example, 
if a finitely presented group G is residually finite 0, then the word problem can be solved 
by the so called McKinsey algorithm |156j : list all finite quotients of G (that is consider all 
finite groups H one by one, consider every map (p from the set of generators of G into H, 
and check if this map takes all rj to 1, and whether the images of generators of G generate 
H] if - yes, then include the pair {H, cp) in the list of quotients), and at the same time list 
all the corollaries of the defining relations of G (say, list all products of conjugates of rj's 
and their inverses) . Every word r is either in the list of corollaries or is not trivial in one of 
the finite quotients. In the first case r = 1 in G, in the second case, r 7^ 1 in G, and after 
some time we will surely know which of this options holds. It is clear that in general this 
algorithm is very slow. Even for a short word r it would take a lot of time to decide, using 
this algorithm, if r = 1 in a given residually finite group. 

2.2.B. Computational complexity. Thus the next thing to do, after we find out that an 
algorithmic problem is decidable, is to find the computational complexity of this problem. 

To make this more precise let us present here some concepts from the Computational 
Complexity Theory. 

Any decision problem D may be considered as a membership problem for elements of some 
set B/) in a subset So- For example if D is the word problem for a group G = {X \ R), then 
Bd is the set of all reduced group words in the alphabet X, and Sd is the set of products 
of conjugates of elements of R and their inverses in the free group over X (i.e. the normal 
subgroup of the free group over X generated by R) . 

With any element x in Bij one associates a number which is called the size of this element. 
Usually the size is roughly the minimal space which is needed to write x down. For example 
the size of a word is typically its length. 



'A group is residually finite if the intersection of all its subgroups of finite index is trivial. 
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The size depends on the way we choose to represent the elements. For example if x is 
a natural number then we can represent x as x units. Then the size of x will be equal to 
X. If we represent x as a sequence of binary digits then the size of x will be approximately 
log2(x). 

Algorithms may be realized by Turing machines (for a formal definition see below). We 
can assume that this machine is equipped with a voice synthesizer and can say two words 
"Yes" and "No". An algorithm solving the decision problem D starts working with an 
element x of written on the tape of the machine. When it ends, it says "Yes" if x G So 
or "No" if 2; Sd- There are also different kinds of algorithms that say "Yes" when x £ So 
and work indefinitely long without saying anything if x ^ Sd- Note that if there is an 
algorithm that recognizes the "yes" part and there is an algorithm recognizing the "no" 
part of D, then there exists an algorithm solving D. 

2.2.C. Turing machines (a formal definition). Recall one of the many equivalent definitions 
of a Turing machine. 

A multi-tape Turing machine has k tapes and k heads observing the tapes. One can view 
it as a quadruple 

M = {I,Y,Q,e) 

where / is the input alphabet, Y is the tape alphabet (/ C y), Q = \jQi,i = 1,. . . ,k is 
the set of stated of the heads of the machine, G is a set of commands. Some (recognizing) 
Turing machines also have two distinguished vectors of state letters: si is the A:-vector of 
start states, sq is the /c-vector of accept states. 

The leftmost square on every tape is always marked by a, the rightmost square is always 
marked by cj. The head is placed between two consecutive squares on the tape. A config- 
uration of a tape of a Turing machine is a word auqvuj where q is the current state of the 
head of that tape, u is the word in Y to the left of the head and v is the word in Y to the 
right of the head, and so the word written on the entire tape is uv. 

At every moment the head of each tape observes two letters on that tape: the last letter 
of u (or a) and the first letter of v (or uj). 

A configuration [/ of a Turing machine is the word 

U1U2 ...Uk 

where Ui is a configuration of tape i. 

Assuming that the Turing machine is recognizing, we can define input configurations and 
accepted (stop) configurations. An input configuration is a configuration where the word 
written on the first tape is in 7"'", all other tapes are empty, the head on the first tape 
observes the right marker w, and the states of all tapes form the start vector si. An accept 
(or stop) configuration is any configuration where the state vector is sq, the accept vector 
of the machine. We shall always assume (as can be easily achieved) that in the accept 
configuration every tape is empty. 

A command of a Turing machine is determined by the states of the heads and some of 
the 2k letters observed by the heads. As a result of a transition we replace some of these 
2k letters by other letters, insert new squares in some of the tapes and may move the heads 
one square to the left (right) with respect to the corresponding tapes. 

For example in a one-tape machine every transition is of the following form: 

uqv — )• u'g't)' 



U denotes disjoint union. 
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where u,v,u',v' are letters (could be end markers) or empty words. The only constraint is 
that the result of applying the substitution uqv — )• u'q'v' to a configuration word must be a 
configuration word again, in particular the end markers cannot be deleted or inserted. 

This command means that if the state of the head is q, u is written to the left of q and 
V is written to the right of q then the machine must replace uhy u' , qhy q' and v hy v' . 

For a general /c-tape machine a command is a vector 

[Ui^Vu...,Uk^Vk] 

where C/j — t- is a command of a 1-tape machine, the elementary commands (also called 
parts of the command) Ui ^ Vi are listed in the order of tape numbers. In order to execute 
this command, the machine first checks if every Ui is a subword of the configuration of tape 
i (i = 1, . . . , fc), and then replaces all Ui by Vj. 

A computation is a sequence of configurations Ci — t- • • • — t- C„, such that for every 
i = 1, . . . ,n — 1 the machine passes from Ci to Cj+i by applying one of the commands from 
Q. A configuration C is said to be accepted by a machine M if there exists at least one 
computation which starts with C and ends with an accept configuration. 

A word u € X* is said to be accepted by the machine if the corresponding input configu- 
ration is accepted. The set of all accepted words over the alphabet X is called the language 
accepted (recognized) by the machine 

Let C = Ci — 7- • • • — 7- C„ be a computation of a machine M such that for every j = 
1, . . . ,g — 1 the configuration Cj+i is obtained from Cj by a command Oj from Q. Then 
we call the word 6i . . . 9n-i the history of this computation. The number n will be called 
the time (or length) of the computation. Let pj (i = 1, . . . , (7) be the sum of the lengths of 
the configurations of the tapes in the configuration Ci . Then the maximum of all pi will be 
called the space of the computation. 

We do not only consider deterministic Turing machines, for example, we allow several 
transitions with the same left side. 

Note that Turing machines can also be viewed as rewriting systems where objects are the 
configurations and substitution rules are the commands of the machine. This point of view 
allows one to use the tools of the string rewriting theory, normal forms, confluence, etc. 
One can also view Turing machines as semigroups of partial transformations of the set of 
configurations generated by the transformations induced by the commands. This point of 
view allows to explore dynamics properties of Turing machines (say, periodic computations). 
That is often needed when one tries to simulate a Turing machine by a group |188| . 

2.2.D. The time and space functions. With every Turing machine A one can associate two 
important functions: the time function and the space function. 

The time {space ) function tA{n) (resp. Syi(n)) of a Turing machine A is the minimal 
number of steps of the machine (resp. number of cells on the tape visited by the machine) 
to decide that any element x € Sd of size < n is indeed in Sd- 

We can compare the time (space) functions of algorithms as follows. If /, 5: N — t- N, we 
write f < g \i for some constant C we have f{n) < Cg{Cn) + Cn + C for every n £ N. 
We say that / and g are equivalent., li f < g and g < f . Thus every sublinear function is 
equivalent to n, functions and 2n^, 2^ and 3^ are equivalent, but functions n^'^ and n^'^ 
are not. 

The following relation is clear: 

SA{n) ■< tA{n). 

Indeed, even if at every step the algorithm used a new tape cell, sa(?t.) would be only 
equivalent to tA{n) (since the number of cells used by the machine at every step is bounded). 
In reality SA{n) is almost always less than tA{n). On the other hand the following relation 
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tA{n) -< 2^-^('^) 

2.2.E. A very rarely accepting Turing machine. Turing machines, like groups, can have 
some extreme properties. It is well known, for example, that a Turing machine can have 
undecidable halting problem, that is there is no algorithm that checks whether the machine 
will ever stop starting with a given input configuration. This is obviously equivalent to 
the property that the time and space functions of this Turing machine are bigger than any 
given recursive function (i.e. a function N — )■ N whose graph and its complement can be 
recognized by a Turing machine). In Section [4.61 we shall need a Turing machine with even 
stronger property. 

Let X be a recursively enumerable language in a 2-letter alphabet, i.e. a set of binary 
words recognized by a (non-deterministic) Turing machine M. If x € X then the time 
of X (denoted time(x) or time a/ (x)) is, by definition, the minimal time of an accepting 
computation of M with input x. For any increasing function /i: N ^ N, a real number m 
is called h-good for M if for every w £ X, \w\ < m implies h{time{w)) < m. 

Let h{n) = expexp(n). 

Theorem 2.6 (Olshanskii, Sapir |182j ). There exists a Turing machine M recognizing a 
recursively enumerable non-recursive set X such that the set of all h-good numbers for M 
is infinite. 

It is easy to observe that the set G of /i-good numbers of M cannot be recursively 
enumerable. Moreover it cannot contain any infinite recursively enumerable subset. Indeed, 
if G contained an infinite recursively enumerable subset R, then for every input configuration 
C of M we would start the Turing machine enumerating R, and wait till we get a number 
r from R that is bigger than the size of G. Then to check if M halts on G, we would 
check all (finitely many) computations of M starting with G and having length less than 
log log r. The Turing machine M halts starting with G if one of these computations ends 
with the accept configuration. Hence the set X would be recursive. On the other hand, 
the complement of G is recursively enumerable because in order to verify that r is not an 
/i-good number, one needs only to start M and wait till it accepts an input configuration 
G of size < r but time(C) > log log r. Thus G is an immune set and its complement is 
simple in terminology of |154j . Note that the very existence of simple and immune sets was 
a non-trivial problem solved by Muchnik and Friedberg using the famous priority argument, 
answering a question of Post (see [154j and |222j for details) . Our proof of Theorem 12.61 also 
uses the priority argument and Matiyasevich's solution of the Hilbert 10th problem |158j . 

2.2.F. The complexity classes. If there exists an algorithm A which solves the "yes" part 
of problem D and tAin) is bounded from above by a polynomial in n then we say that D 
can be solved in polynomial time. The solvability in exponential time, polynomial space, 
exponential space, etc. is defined similarly. 

It is worth mentioning that if we modernize the Turing machine by, say, adding more 
tapes or heads, we won't change the complexity of the problem much. For example a non- 
polynomial time (space) problem cannot become polynomial as a result of that. The class 
of all problems which can be solved in polynomial time is denoted by P. 

If in the definition of the time complexity, we replace the deterministic Turing machines 
by non-deterministic Turing machines, then we obtain definitions of the solvability in non- 
deterministic polynomial time, non-deterministic exponential time, etc. Recall that a non- 
deterministic Turing machine is more intelligent than a deterministic one: it does not blindly 
obey the commands of the program, but, at every step, guesses itself what the next step 
should be. Roughly speaking a problem D can be solved in non-deterministic polynomial 



also holds 
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time if for every element x ^ Bo there exists a proof (usually called witness) that x belongs 
to Sd and the length (size) of this proof is bounded by a polynomial of the size of x. The 
class of all problems which can be solved in polynomial time by a non-deterministic Turing 
machine is denoted by NP. It is not known if P=NP. This is one of the central problems in 
Theoretical Computer Science. 

In order to prove that a problem is solvable in polynomial time it is enough to find a 
polynomial time algorithm solving this problem. 

We have mentioned that the complexity of the McKinsey algorithm presented in 12.21 is 
very high, but we do not know any finitely presented residually finite group with really hard 
word problem. Thus we formulate 

Problem 2.7. Is there (time or space) complexity bound for the word problem in a finitely 
presented residually finite group. More precisely, what is the highest class in the time 
complexity hierarchy [83] where the word problem in every finitely presented residually 
finite group belongs? An even more bold question: is the word problem of every finitely 
presented residually finite group in NP? 

2.2.G. Reducing one problem to another. In order to prove that a problem D cannot be 
solved in polynomial (exponential, etc.) time one has to take another problem Q which is 
known to be "hard" and reduce it to D. 

There are several kinds of reductions used in the Computer Science literature. One of 
them, the polynomial reduction in the sense of Karp [83], is the following. A reduction of 
a problem Q to a problem D \s a, function (p from Bq to Bu such that 

• An element x from Bq belongs to Sq if and only if (j){x) belongs to Sd. 

• The element (j){x) can be computed in polynomial time, in particular the size of 
4>{x) is bounded by a polynomial of the size of x. 

It is clear that if Q is "hard" and Q can be reduced to D then D is "hard" as well. 

Notice that, when we prove the undecidability of a problem D, we usually also reduce a 
problem Q known to be "hard" (which in this case means undecidable) , to D. In order to 
reduce Q to D we find a similar mapping (p, but we do not care about the size of 0(x). The 
"basic hard" problem is usually the halting problem of a Turing machine (or some other 
kind of machine such as Minsky machines). But it could be a different kind of undecidable 
problem such as the Hilbert's 10th problem (see [1371 Section 6] for details) or a membership 
in the range of a general recursive function as in McKenzie-Thompson [155] . 

2.2.II. An NP-complete problem. Many known problems A are such that A is in NP and 
every other NP-problem polynomially reduces to A. Such problems are called NP-complete. 
One of these problems is the following exact bin packing problem |83l Page 226], |1351 Page 



Problem 2.8. (Exact bin packing problem) 

• Input: A /c-tuple of positive integers (ri, . . . , r^) and positive integers B, N. 

• Output: "Yes" if there exists a partition of {1, . . . ,k} into N subsets {1, . . . ,k} = 
5i U . . . U Sn such that for each i = 1, . . . , N we have 



2.2.1. Complexity classes and algebraic systems. A deep connection between Computer Sci- 
ence and algebra was found by R.Fagin [79]. He proved, in particular, the following amazing 
model-theoretic characterization of classes of finite algebras whose membership problem can 
be solved in non-deterministic polynomial time. Recall that an algebraic system is by def- 
inition a set with certain operations and predicates of various arities [154] . For example. 
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a linearly ordered group is an algebraic system with one binary operation (product), one 
unary operation (taking inverses), one nullary operation (the identity element) and one 
binary predicate giving the order relation. 

Theorem 2.9 (Fagin [79]). The membership problem for an abstract (i.e. closed under 
isomorphisms) class of finite algebraic systems is in NP if and only if it is the class of all 
finite models of a second-order formula of the following type: 

3Qi3Q2...3Q„(G) 

where Qi is a predicate, and Q is a first- order formula. 

Basically this theorem means that the membership problem of a class of finite algebraic 
systems is in NP if and only if we can describe the structure of algebraic systems of this class 
in terms of functions and relations. Since all known methods of studying the structure of 
algebraic systems are based on studying functions (homomorphisms, polynomial functions, 
etc.) and relations (congruences, orders, etc.) we can conclude that for the membership 
problem of a class of finite algebras being in NP is equivalent to admitting a reasonable 
structure description. 

Note that a word in an alphabet with k letters can be considered as an algebraic system 
as well: a word is a linearly ordered finite set (say, {1, 2, . . . , n}) with k unary predicates Lx 
indexed by the letters of the alphabet such that Lx{p) holds if the p-th letter in the word is 
X. The set of all words in the A;-letter alphabet can be defined by obvious axioms involving 
these k -\- 1 predicates (the order predicate should satisfy the axioms of linear order, plus 
for every p = 1, 2, . . . , n one and only one of the statements Lx{p) holds). Reduced group 
words can also be defined using similar axioms, etc. Hence being able to solve the word 
problem in a group in non-deterministic polynomial time means, by Fagin's theorem, that 
we can find algebraic description of words that are equal to 1 in the group. 

Note that classes where the membership problem has other types of computational com- 
plexity have similar model theoretic characterizations. Classes with membership problem in 
P have been characterized by Immerman [127j . Sazonov |214j . |213j and Vardi |230j . Classes 
with exponential time membership problem were characterized by Fagin [79]. Classes with 
non-deterministic exponential time membership problem have been characterized by Fagin 
[79] and Jones and Selman [132] . For a detailed survey of these results see [80] . 

2. 2. J. Van Kampen diagrams as witnesses. Now let again G = {X | i?) be a finitely pre- 
sented group. Suppose that a word ^i; in X is equal to 1 in G. Then by Lemma l2.3b f van 
Kampen, w labels the boundary of a van Kampen diagram A over the presentation of G. 
Let us consider that van Kampen diagram as a witness for the equality w = 1. Since a van 
Kampen diagram cannot be drawn on a tape of a Turing machine, we need to replace it by 
an equivalent object which is a word. Such an object is given by Proposition 12.41 Instead 
of A, we should consider the (non-reduced) product 

(2) ui-ri-U2-r2- ...-Um-rd- Um+l 

from that proposition. By Proposition 12.41 the length of that word is — up to a constant 
multiple — the number of cells in A plus \w\. Since we identify functions that differ by a 
constant factor or linear summand, we can say that the size of A is the number of cells, i.e. 
the area of A. Clearly given a word of the form ([2]), i.e. a word subdivided into subwords 
Ui, rj, it takes linear time (in the length of the word) to check if it indeed satisfies conditions 
of Proposition 12.41 Thus we obtain the following 

Proposition 2.10 (See the proof of Theorem 1.1 |210] ). Suppose that for every word w 
that is equal to 1 in G there exists a van Kampen diagram over the presentation of G with 
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at most f{\w\) cells and boundary label w. Then the "yes part" of the word problem of G 
can be solved in non- deterministic time at most f{n). In particular if f is bounded by a 
polynomial, then the "yes" part of the word problem of G is in NP. 

2.3. Diagrams on surfaces, quadratic equations and Poincare conjecture. 

2. 3. A. The definition. Let S = Sg^n be a (possibly non-oriented) surface of genus g with n 
boundary components. Let again V = {X | i?) be a finite group presentation. A diagram 
on 5 is a tessellation of 5 by a labeled graph with all edges labeled by letters of X and all 
cells (regions) being discs with boundaries labeled by words from R^^. In fact, a diagram 
on a surface can be obtained from a usual (disc) van Kampen diagram as follows. Let A 
be a disc diagram whose boundary is a polygon where some pairs of sides are labeled by 
the same or mutually inverse words. If we identify these pairs of sides, we get a diagram on 
a surface (with possibly non-empty boundary). Conversely, given a diagram on a surface 
S, we can cut the surface by simple closed curves and arcs following edges of the diagram 
so that we obtain a polygon with some pairs of sides (corresponding to the cutting curves) 
labeled by the same or mutually inverse words). The result is an ordinary van Kampen 
diagram. For example, if we want to find out whether an element represented by a word u 
is a product of g commutator^ [2^1, yi] • • • [xg^Vg] in a group G = {X \ R),we need to check 
whether there exists a van Kampen diagram on an oriented surface of genus g with one 
boundary component labeled by u. In general, one can consider van Kampen diagrams on 
surfaces with boundaries when one tries to solve any quadratic equation w{xi, . . . , Xk) = 1 in 
G. Here tt; is a word in the alphabet XU{xi, . . . ,Xk} (letters from X are called coefficients, 
Xi are called unknowns), and for every i, the word w has exactly two occurrences of the 
form xf^ (hence the term quadratic). If the equation w = 1 does not have any coefficients, 
say, w = [xi,yi][x2,y2] ■ ■ ■ [xn,yn], then solutions of = 1 in a group G are in one-to-one 
correspondence with homomorphisms from the surface group 7ri(S') to G. 

2.3.B. The complexity of solving quadratic equations. Using van Kampen diagrams on sur- 
faces Olshanskii proved: 

Theorem 2.11 (Olshanskii |174j . see also Grigorchuk and Kurchanov [94], Griorchuk and 
Lysenok [95], and Lysenok |149j ). For every n, the solvability problem for quadratic equa- 
tions in n variables in any free group is in P (the size of the input of the problem is the 
sum of lengths of the coefficients). 

Note that the solvability of that problem was first proved by Cromerford and Edmunds 
in |56j . 

Recently Kharlampovich, Lysenok, Myasnikov and Touikan |135| proved that the uniform 
solvability problem (when the number of variables is not fixed) is harder. They also use 
diagrams on surfaces and results from [174j . plus some clever arguments reducing NP- 
complete exact bin packing problem (Problem 12. 8p to the problem of filling (tesselating) a 
surface with a van Kampen diagram over the presentation consisting of 0-cells. 

Theorem 2.12 (Kharlampovich, Lysenok, Myasnikov, Touikan |135| ). The (uniform prob- 
lem of) solvability of quadratic equations in the free group in NP-complete (the input of 
that problem is an equation, the output is "yes", if the equation has a solution in a free 
group). 

Note that the fact that solvability of quadratic equations over free monoids is NP-hard 
was proved before [135j by Diekert and Robson in [69]. It is still not known whether that 
(seemingly easier) problem is NP-complete. As Volker Diekert told us, the NP-completeness 

^[x,y] denotes xyx~^y~^ . 
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in the case of monoids would imply Theorem 12.121 and so the case of monoids is very 
interesting. 

2.3.C. Quadratic equations and the Poincare conjecture. Note that quadratic equations 
even without coefficients (i.e. homomorphisms from the surface group) are closely related 
to the Poincare conjecture. There are several reformulations of the Poincare conjecture in 
terms of homomorphisms of surface groups onto the direct product of two free groups (see 
Jaco jl31j . Stallings [225\ . Olshanskii |174] ). Here is the reformulation from Hempel |119j . 
Since the Poincare conjecture has now been proved by Perelman [197^ I198j . we formulate 
that reformulation as a theorem. 

Theorem 2.13 (Hampel |119j . Perelman [197^ 1198] ). Let S be a closed oriented surface, 
TTi{S) be its fundamental group. Suppose that (pi and (f)2 are two homomorphisms from 
TTi{S) onto the direct product of two free groups Fn x Fn,. Then there is an automorphism 
a of 'Ki{S) and an automorphism /3 of Fn x Fn such that a(j)i(3 = 4>2. 

This theorem means that there exists essentially at most one "maximal" solution of the 
equation [xi, yi] . . . [xg,yg] = 1 in F„ x F„, for every n. 

2.3.D. A link between Poincare conjecture and NP. The proof of Poincare conjecture implies 
that certain algorithmic problems are in NP. For example, consider the following problem 

Problem 2.14. (Triviality of the fundamental group of a 3-manifold). 

Input. A compact 3-manifold M without boundary represented as a union of tetrahedra. 
Output. "Yes" if the fundamental group of M is trivial and "no" otherwise. 

Note that the triviality problem is a part of the uniform word problem (see I2.2.AP : given 
a presentation of a group we need to decide if each generator is equal to 1 in the group. 
Note also that giving a triangulation of M, we can easily find the presentation of tti(M) 
which is of a comparable size as M. 

That Problem 12.141 is in NP follows from the fact that simply connected compact 3- 
manifolds are homeomorphic to the 3-sphere (the Poincare conjecture) and the result of 
S.V. Ivanov |129j that the problem of recognizing a 3-sphere is in NP. 

2.3.E. Other equations in free groups. Arbitrary equations in free groups correspond to 
diagrams on more complicated 2-dimensional complexes than surfaces and the solvability 
problem for them is much harder. A very non-trivial result of Makanin shows that this 
problem is decidable. 

Theorem 2.15 (Makanin |151j ). There exists an algorithm to check whether a system of 
equations over a free group has a solution. 

Note that for some relatively easy groups the problem of solvability of systems of equations 
is undecidable (this is so for, say, a finitely generated nilpotent group, see I3.2.A1) . 

Makanin's algorithm is very complicated and its complexity is known to be a tower of 
many exponents. Later, Makanin's algorithm was simplified and clarified by Razborov 
[201^ I202j . Razborov also gave a description of all solutions. Still the exact complexity 
of solving equations in free groups was an open problem till the following relatively recent 
result. 

Theorem 2.16 (Diekert, Gutierrez, Hagenah [TO]). The problem of solvability of an equa- 
tion in a free group is in PSPA CE, that is it can be solved by a deterministic Turing machine 
using polynomial amount of space (in terms of the size of the equation). 
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Other results from ^70j show that PSPACE most probably cannot be improved because a 
very similar problem of deciding the existential theory of a free group (i.e. solving systems 
of equations and inequations of the form w ^ 1) turned out to be PSPACE-complete (the 
solvability of systems of equations and inequations in a free group was proved to be decidable 
by Makanin in |152j ). See also I3.1.G] below. 

2.3. F. The conjugacy problem and annular diagrams. Note that the conjugacy problem 
is about solvability of the (quadratic) equations of the form xux^^ = v where u, v are 
coefficients and x is an unknown. Thus bv l2.3l we need to consider diagrams on an annulus. 
So let the surface S be an annulus (i.e. the surface 80^2), V he a presentation of some group 
G, and u, v be two group words over the generating set of this presentation. Then u and 
V are conjugate in G if and only if there exists a diagram A on S over a presentation V 
with boundary labels u,v (that was first noticed by Schupp and appeared in |163] . hence 
annular diagrams are sometimes called Schupp diagrams). 

2.4. Isoperimetric and isodiametric functions of groups. 

2. 4. A. The isoperimetric functions. Thus with every finite presentation V of group G we 
associate the following important function: /(n) is the smallest number such that every word 
w that is equal to 1 in G, l^yj < n, labels the boundary of a van Kampen diagram over the 
presentation of G with at most f{n) cells. This function is called the Dehn function of V. It 
is easy to see that the Dehn functions of different finite presentations of the same group are 
equivalent, hence we can talk about the Dehn function of a finitely presented group. Every 
function 51 : N — )• N with f ~< g is called an isoperimetric function of G. The Dehn functions 
of groups were introduced (under different names) by Computer Scientists Madlener and 
Otto |150j studying complexity of the word problem in groups, and by Gromov [98] as a 
geometric invariant of groups (see also Gersten |86] where the name "Dehn function" was 
introduced) . 

As a direct consequence of Proposition 12. 10^ we get 

Theorem 2.17. (See [210^ Theorem 1.1]). Every Dehn function of a finitely presented group 
G is (equivalent to) the time function of a Turing machine solving (non-deterministically) 
the "yes" part of the word problem in G. 

Hence we have 

Theorem 2.18 (Madlener, Otto [150] . Gersten [86]). The word problem in a finitely pre- 
sented group is solvable if and only if the Dehn function ( or one of the isoperimetric func- 
tions) is recursive. 

Being a geometric invariant means, in particular, that if two finitely presented groups 
are quasi-isometric, then their Dehn functions are equivalent. Recall that if Mi, M2 are 
metric spaces with distance functions di,d2, then a partial function (j) : Mi — ?• M2 is called 
quasi-isometry between Mi, M2 if there exist three constants A > 1, B > 0,G > such 
that 

(3) J d2{(t>{x)A{y)) -B<di{x,y)<A d2{(t>{x). (t>{y)) + B 

for all x,y (z Mi and every point in Mi is at distance at most C from the domain of (j), 
while every point in M2 is at distance at most C from the range of (p. In this case we call (p 
(^, i?, C)-quasi-isometry. If C = 0, we call (p {A, B)- quasi-isometry, and if i? = C = 0, 
then (p is an A- bi-Lipschitz map. Finally if i? = C = and we only have the second of the 
two inequalities in then (p is an A-Lipschitz map. 
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We say that (p: Mi — )• M2 is a quasi-isometry (bi-Lipschitz map or Lipschitz map) from 
Ml to M2 if this map is a quasi-isometry (bi-Lipschitz map or Lipschitz map) between Mi 
and 0(Mi). 

The spaces Mi and M2 are cahed quasi-isometric if there exists a quasi-isometry between 
Ml and M2. For example, any two Cayley graphs (see 12.81 for a definition) of a group 
with respect to two finite generating sets are quasi-isometric as metric spaces (the distance 
between two vertices is the length of a shortest path connecting them). Moreover any 
isomorphism between any subgroup of finite index in one group and a subgroup of finite 
index of another group induces a quasi-isometry of the Cayley graphs of these groups (that 
was a starting point of a proof of the celebrated Mostow rigidity theorem). Two finitely 
generated groups are called quasi-isometric if their Cayley graphs (with respect to finite 
generating sets) are quasi-isometric. It is not difficult to observe that two quasi-isometric 
groups have equivalent Dehn functions. In particular, if one of the two quasi-isometric 
groups has solvable word problem, the other group must also have solvable word problem 
(by Theorem EH]). 

2.4.B. The conjugacy problem and quasi-isometry. Note that the analogous statement is 
not true for the conjugacy problem: there exist two pairs of groups Ai > Bi and -B2 > ^2) 
each extension is of index 2, such that Ai,A2 have solvable conjugacy problem which Bi, i?2 
have unsolvable conjugacy problem (see Gorjaga-Kirkinskii [93j and Collins-Miller |55j). 

2.4. C. The isodiametric function. Another important function associated with van Kampen 
diagrams is the isodiametric function: for every word w that is equal to 1 in the group 
G = {X I R), let di(w;) be the minimal diameter of a van Kampen diagram with boundary 
label w. Then di(n) is the maximal value of di(w;) for all w with \w\ < n, w =g 1- 
The area of a van Kampen diagram with boundary label w gives the number of factors 
in a representation of id as a product of conjugates of elements of R and their inverses 
(by Proposition 12.10"]) . The diameter is roughly the maximal size of a conjugator used in 
this representation. It is easy to see that the isodiametric function does not depend (up to 
equivalence) on the finite presentation of G. It is known (D. Cohen [52], Gersten [85], Birget 
|20j . Gersten and Riley |90j ) that for every finitely presented group G the Dehn function 
/(n) and isodiametric function g{n) satisfy the following double exponential inequality: 

f[n) < a 

for some constants a, b. This immediately implies that the word problem in G is solvable if 
and only if the isodiametric function is recursive. It is still an open problem whether one 
can reduce the number of exponents to one. 

Problem 2.19 (D. Cohen [52], also attributed to Stallings). Is it true that for every finitely 
presented group G there exists a constant a > 1 such that 

f{n) < a^'^") 

where /, g are, respectively, the Dehn function and the isodiametric function of G? 

2.5. Filling Length functions and the space complexity. If G = (X | R), u; = 1 in 

G, then there is a sequence of transformations 

(4) —)• —)•...—)• 1 

where at every step we either insert a relation from R^^, insert or delete a subword of the 
form xx-'^, X £ X^^ ^50\. The area of w is equivalent to the minimal length of any such 
sequence. We can also estimate the space needed by a Turing machine in order to recognize 
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words which are equal to 1 in G by looking at the sizes of the intermediate words Wi. The 
minimum for all sequences ([4]) of the maximal length of Wi is called the filling length of w. 
One can follow Gromov [98] and define the filling length function FL(I) of a group using 
the filling length of words in the usual way. Gromov [99^ Page 100] noticed that in many 
finitely presented groups FL(/) is at most a constant factor of the isodiametric function of 
G, and asked if it is so for all finitely presented groups. 

FL is probably the most obvious way to define the space function for a group, but it 
turned out not to be the most natural function, or, more precisely, unlike in the case of Dehn 
functions, different natural definition of the space function give drastically different results. 
In particular, as suggested in Bridson- Riley [39], one can allow taking cyclic shifts of words 
and splitting a word Wi into a pair of words (uj , Vi) each equal to 1 in the group. The resulting 
function is called the fragmenting free filling length function of a group denoted FFFL(/) 
|39| or the space function of a group [183] . Both Gromov's filling length function FL and 
Bridson-Riley fragmented filling length function FFFL have nice geometric interpretation 
in terms of the transformation of van Kampen diagrams. Computing the filling length 
amounts to removing cells of the diagram one by one without changing the base point 
on the boundary and looking at the lengths of the boundaries of the resulting diagrams. 
When computing fragmenting free filling length function of a group, we are also allowed to 
change the base point, and divide a diagram into two subdiagrams (and then taking the 
sums of lengths of the boundaries of the pieces). It is proved in [39] that these functions 
behave differently for some finitely presented group G, for instance, the function FFFL can 
grow linearly while FL has exponential growth. Since cutting van Kampen diagrams is the 
main tool in studying them, the function FFFL seems to be the most natural group theory 
analog of the space function of a Turing machine (see Olshanskii |183j ). As in the case of 
Dehn functions, the function FFFL does not depend on the presentation of a group (up 
to equivalence), and for every finitely presented group G there exists a non-deterministic 
Turing machine recognizing the word problem in G whose space function is equivalent 
to FFFL (this "diagram eating" Turing machine is essentially described in [210^ Section 3, 
Proof of Theorem 1.1]). Thus the function FFFL is in general closer to the space complexity 
function of the word problem of a finitely presented group than FL. 

Note that similarly to Theorem 12. 17|, but easier, one can establish the following 

Proposition 2.20 (Olshanskii |183j ). The space function of a finitely presented group G 
is equivalent to the space function of a non- deterministic Turing machine. The language 
accepted by this machine coincides with the set of words equal to 1 in the group. 

2.6. Two examples of Dehn functions. 

2. 6. A. Two concrete groups. We shall give two standard examples of group presentations 
where the Dehn function is computed explicitly. 

Example 2.21. The Dehn function ofl? = (a, 6 | a6 = 6a) is at most quadratic (in fact it 
is exactly quadratic by, say, \2.S\ or \3.2.A\ but we shall prove only the upper bound). 

Example 2.22. The Dehn function of the Baumslag-Solitar group BS(1,2) = {a,b \ 
bab^^ = a^) is at least exponential (in fact it is exactly exponential by, say, J. Groves 
and S. Hermiller |104j . but we shall prove only the lower bound). 

2.6.B. HNN- extensions. Both examples are HNN-extensions (of the cyclic group). Recall 
[147j that if G is a group, A,B are isomorphic subgroups of G, and 0: A — )■ S is an 
isomorphism, then the HNN-extension of G with free letter t and associated subgroups 
A, B is the group 

HNN(G; A, B, <p) = {G,t\ tat'^ = (j){a) for every a £ A.) 
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That is the presentation of that group is obtained by adding letter t to the generating set, 
keeping the defining relations of G, and adding the HNN-relation tat~^ = 0(a) for every 
a G A. In fact it is enough to add HNN-relation for every generator of A. Hence if G 
is finitely presented, A,B are finitely generated, then HNN(G; A, 6, is finitely presented. 
We shall also use multiple HNN-extensions. These correspond to a group G, and a collection 
of pairs of isomorphic subgroups Ai,Bi, i = 1, . . . , n and isomorphisms (pi: Ai — )• Bi. The 
corresponding multiple HNN-extension of G has the following presentation 

HNN(G; {Ai), (B,), (0,)) = . . . | Uatr^ = c/),{a) 

for every a £ Ai,i = 1, . . . ,n) . 

A cell corresponding to an HNN-relation looks like a quadrangle (see Figure [8|) with 
edges labeled by ti on the opposite sides pointing the same way; a and (/>i(a) there are 
words representing the corresponding elements of Ai and Bi respectively. 




a 



Figure 8. An HNN-cell 



If A is a van Kampen diagram over this presentation, and vr is an HNN-cell inside it, 
then each edge of vr labeled by ti must be an edge of another HNN-cell, hence HNN-cells 
with tj-edges form ti-bands (also called ti- corridors), that is sequences of cells 7ri,...,7rfc 
where every two consecutive cells share a tj-edge. Bands are the main tools for studying 
van Kampen diagrams over multiple HNN-extensions. 

The following basic lemma was essentially proved in |163j . 

Lemma 2.23. Let A be a diagram over an HNN-extension presentation. Suppose that A 
is minimal, i.e. has the smallest number of HNN-cells among all diagrams with the same 
boundary label. Then no ti-band is an annulus (i.e. one of the ti-edges of the first cell in 
the band does not coincide with any other ti-edge of the band). 

Proof. Indeed, suppose a tj-band T is an annulus. Suppose for convenience that the tj-edges 
of that band point inside the annulus (the other case is similar). We can also assume that the 
subdiagram A' bound by the inner boundary component of T is a smallest (under inclusion) 
among all subdiagrams bounded by inner boundaries of tj-annuli. Then A' cannot contain 
any tj-annuli. Since the boundary of A' does not contain any tj-edges, it cannot contain any 
HNN-cells except for the cells from T. Hence, by the van Kampen lemma (more precisely 
by Proposition 12. 4p the boundary label of A' is equal to 1 in G. Since (pi is an isomorphism, 
then the label of the outer boundary component of T is equal to 1 in G. Hence we can 
replace the annulus T together with A' by a van Kampen diagram over G (by Lemma [2.3p . 
This reduces the number of HNN-cells in the diagram without changing the boundary label, 
a contradiction. □ 

Lemma 12.231 immediately implies the main property of HNN-extensions 

Corollary 2.24. The group G naturally embeds into HNN(G'; (Ai), (Bi), (ti)). 

Proof. Suppose that the natural homomorphism from G to the HNN-extension 

HNN(G; {Ai),{B,),{t.,)) 
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has a word w in the kernel. Then by Lemma 12.31 there exists a van Kampen diagram A 
over the presentation of HNN(G; [Ai), (Bi), (ti)) with boundary label w. We can assume 
that A is minimal. By Lemma 12.231 it does not contain tj-annuli. Hence every maximal 
tj-band in A must start and end on the boundary of A. But the boundary of A does not 
have tj-edges. Hence A does not have HNN-cells, and it is a diagram over the presentation 
of G. By Proposition 12.41 is 1 in G. Thus the kernel of the natural liomomorphism is 
trivial. □ 

2.6.C. Proofs of Examples [EM. [Ql Now Examples E^Il E^D are easy to prove. 

Example I2.21L The group I? can be considered as an HNN-extension in two different 
ways: we can consider a or 6 as a free letter (indeed ab = ba is equivalent to aba~^ = b 
and to bab~^ = a). Hence we can consider a-bands and 6-bands. Let A be a van Kampen 
diagram over the presentation of 7? with minimal number of cells among all diagrams with 
the same boundary label. It is easy to see (the proof is similar to the proof of Lemma 
I2.23P that every a-band can intersect every 6-band in A only once. Since there are no a- 
and 6-annuli in A (by Lemma l2.23p . every maximal a- and 6-band starts and ends on the 
boundary of A. Hence if I is the length of the boundary of A, then there are at most 
1/2 maximal a-bands and at most 1/2 maximal 6-bands in A. Since every cell in A is the 
intersection of an o-band and a 6-band, the area of A is at most P/A and the Dehn function 
is at most quadratic. 

Example 12.221 We are going to use the fact that the cyclic subgroup (a) is exponentially 
distorted in the Baumslag-Solitar group BS(1,2): b"'ab~^ = (so the element has 
linear in n length in BS(1,2) and exponential length in (a)). 

Consider a diagram A over the presentation of the Baumslag-Solitar group with boundary 
label ab~"'a~^b"'a~^b~^a~^b"' as on Figure [H For simplicity suppose that n is odd. Again 
we can assume that A has the smallest number of cells among all diagrams with the same 
boundary label. 



a 




Figure 9. A diagram with boundary label a6 "a ^6"a ^b "a ^6" over the 
presentation of the Baumslag-Solitar group BS(1,2). 



Since 6 is a free letter, we can consider 6-bands in A. The length of the boundary of 
A is 4n -|- 4. Consider the 6-band that starts at a 6-edge on one of the 4 sides labeled by 
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b"'. It must end on one of the other two sides labeled by 6" where the dh'ection of 6-edges 
is opposite (we trace the boundary counterclockwise). Since 6-bands do not intersect, the 
6-band B staring at the middle 6-edge (here we use the fact that n is odd) must end at the 
middle 6-edge of the other side of the diagram (see Figure [9]) . It is not difficult to show 
that the 6-band can only be horizontal (so only one of the two possibilities on Figure [9] can 
occur). Let u be the label of the shorter side p of the 6-band labeled by a power of a. Then 
the path p cuts off a subdiagram A' with boundary label 

5{n-l)/2„^-{n-l)/2^-l_ 

By Lemma l2.3t that word must be equal to 1 in BS(1, 2). Note that 

5{n-l)/2„^-{n-l)/2 ^ ^2(-i)/2 

in that group. Hence u = a^'" ^^^'^ . By Corollary 12.241 then u has length at least 2'^"'"^)/^. 
But that means that the length of B is at least 2^""^)/^, hence the area of A is at least 
2("~i)/2 while the perimeter is linear in n. Since A is a minimal diagram, the Dehn function 
of the BS(1, 2) is at least exponential. 

2.7. Isoperimetric functions of geodesic metric spaces. Different area functions. 

The notion of the area of a loop is well defined in the setting of Riemannian manifolds as 
well as in finitely presented groups (see for instance [371 Chapter 1, Section 8.1.4]). It also 
can be defined for geodesic metric spaces. In fact there are several kinds of definitions of 
area in that case. Below, we shall mention the definition of Bowditch [29], \28\ Sections 2.3, 
5], Gromov's definition from of coarse area function from \99\ 5.F] and a definition using 
metric currents [5l I234| [235j . In all these definitions (X, dist) is a geodesic metric space. By 
a loop in X we shall always assume a Lipschitz (hence rectifiable) map from the unit circle 
to X. Let Q be the set of all loops in X. Each definition of an area defines a function 
j4: — )• Then we define the corresponding isoperimetric function as the function A{1) 
such that A(l) = max{^(7) | 7 S 0, 17I < /}. Sometimes we shall call any function f >- A 
also an isoperimetric function of X. 

2.7. A. The definition of Bowditch. For every curve a: [0,1] — t- X, by —a we denote the 
curve a(l — t) (that is the same curve traced in the opposite direction). Let ao,ai,a2 be 
three Lipschitz curves [0, 1] — ?• X connecting points A.,B G X. Then we can form three 
loops Ui U (— aj+i) where i = 0, 1, 2, and +1 is understood modulo 3. Whenever three loops 
are obtained this way, one says that they form a 6-loop. Now let O be the set of all loops 
m Let A be a function from Q. to M+ satisfying the following two conditions 

(-Bi) [The triangle inequality for ^-loops.] For every three loops 71,72, 73 G forming a 

0-loop, we have ^(71) < ^(72) + ^(73)- 
{B2) [The quadrangle inequality.] Suppose that 7 G is split into 4 subpaths ai, 

03, 04. Then ^4(7) > kdid2 where di = dist(ai,a3) and d2 = dist(a2, 04), /c is a 

constant. 

2.7.B. The coarse definition of area function by Gromov. A 6-filling of a loop 7 is a pair 
consisting of a triangulation of the planar unit disk and of an injective map ^p from 
the set of vertices of the triangulation to X such that the restriction of ip to the points on 
dD^ = coincides with 7. The image of the map is called a filling disk ofj. We can join 
the images of the vertices of each triangle of the triangulation of by geodesies. If two 
of these vertices are on dD^, we replace the geodesic by the arc of 7(5'^) connecting these 

^In fact it is enough to assume that SI is a set of loops closed under the operation of cutting a loop into 
two loops by a geodesic segment connecting two points on the loop. Thus the area function can be defined 
even for some non-simply connected spaces. 
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points. Thus we obtain a finite number of triangles (some sides are geodesies, some sides 
are arcs in 7(5"^). These triangles are called bricks. The length of a brick is the sum of the 
lengths of its sides. The maximum of the lengths of the bricks in a partition is called the 
mesh of the partition. The partition is called (5-filling partition of 7 if its mesh is at most 
6. The corresponding filling disk is called 5-filling disk of 7. The J-area of 7 is the minimal 
number of triangles in a triangulation associated to a 5-filling partition of 7. We denote 
it by ^5(7). If no 5-filling partition of the loop 7 exists, we put ^5(7) = 00. Given the 
definition of a 5-area of a loop, we can naturally define the 5-isoperimetric function As{l) 
as the supremum of (5-areas of all loops of length at most /. Note that As{l) may be equal 
00 even if the ^5(7) < 00 for every 7. The corresponding isoperimetric function will be 
denoted by A^il). We shall always assume that our space X is ^-simply connected for some 
// > 0, that is A^{1) < 00 for every /. In this case all functions As, 6 > fj, are equivalent (see 
[72]). Note that if two metric spaces X and Y are quasi-isometric, and /^-simply connected, 
then their ^^-functions are equivalent for 5 > fi. 

Note that the coarse area function A^ satisfies the Bowditch conditions (-Bi), (^2)- 

2.7.C. The metric currents definition. This definition was used to obtain remarkable results 
in [234^ 1235] . Since it is more complicated than the other two definitions, we present only 
some ideas it is based on, referring the reader to [5l |43]. First let M be an n-dimensional 
Riemannian manifold. Let be a 2-dimensional disc in equipped with the Euclidean 
coordinate system, and / be a differentiable map from to M. Then for every x S we 
have two derivatives fx = (ai, . . . , a„) and fy = (61, ... , bn), vectors in the tangent space 
rj(3;)(M) corresponding to the two basic directions in . Then the Jacobian of / at x is 
the number 



(see [m Section 15]) Integrating J(x) over D gives the area of the map /. If / is not 
differentiable but only Lipschitz, then by the classical Rademacher's theorem (see [43]), / is 
differentiable almost everywhere with respect to the Lebesques measure, so J{x) is defined 
for almost all x, and we still can integrate J{x) over D. So areas are defined for Lipschitz 
maps too. This allows one to define a Lipschitz area function L{n) in every Riemannian 
manifold M. 

Now if M is not a Riemannian manifold but just a metric space, and / is a Lipschitz 
map from to M then in order to define an area of /, we need to use, for example, 
the Hausdorff measure (one can use other measures as well, such as the inner HausdorfF 
measure of Busemann |43j or mass* measure of Gromov [97\ Section 4.1]). Recall, that if E 
is a subset of M, then the 2-dimensional Hausdorff measure of E is, up to a scalar multiple, 
the limit as e — ?• of the infimum of sums H2{E) = ^diam(Sj)^ for all covers of E by 
countably many balls Ei of diameters < e. Then the area of the map / is the i72-nieasure of 
f{D'^). Note that for Riemannian manifolds M this definition coincides (up to a constant 
factor) with the previous definition. 

Unfortunately this definition behaves badly if we try to estimate area functions of limits 
of metric spaces, say, asymptotic cones: the area of a limit is not the limit of areas. In order 
to overcome that we need to further generalize the Lipschitz maps. 

A singular Lipschitz chain is a finite formal combination C = ruifi where is a 
Lipschitz map from some region in to the metric space X. The area of a chain C is 

?njArea(/i) (see [97]). The boundary of one Lipschitz map / : — )• X is the restriction 
of / to dD^. The boundary of a linear combination is the linear combination of boundaries 
which can be viewed as an element of the first homology group of X. Thus we can define 
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a (homological) area of a loop 7 as the smallest area of a singular Lipschitz chain with 
boundary 7Q 

One can introduce a metric on the space of all singular Lipschitz chains from 'S? to X. 
It can be, say, the flat metric (the definition goes back to Witney |236| ). similar to the 
well-known Gromov-Hausdorff metric |103| . Then one can complete the space of singular 
Lipschitz chains with respect to this metric and obtain the space of 2-dimensional inte- 
gral metric currents. The area of an integral metric current is the limit of areas of the 
corresponding singular Lipschitz chains. Thus one can view 2-dimensional integral metric 
currents as infinite linear combinations of Lipschitz maps from into X. 

2.8. Cayley complexes and metric spaces. Let V = {X | i?) be a presentation of a 
group G with X finite, X = X~^ . Consider the Cayley graph V = Tx{G) (i.e. G as the 
set of vertices and {{g,gx) \ x ^ X,g ^ G} as the set of edges. If we label every edge of F 
by the corresponding element x G X, we get a labeled Cayley graph. The graph is directed 
but each edge {g, gx) has the inverse {gx, x). Thus when we travel from g to gx, we read x, 
when we travel from gx to g, we read x~^. 

Every group word wm.X labels unique path in the labeled Cayley graph starting from any 
vertex g £ G. It ends at the vertex gw. In particular the equality w = 1 is true in G if and 
only if that path labels a loop starting from any vertex of G. For every r £ R, let us consider 
all loops 7 in F labeled by r and let us glue in a disc to F with dDy = 7. This way 
we obtain an (edge-labeled) CW-complex CF(P), the Cayley complex of G corresponding 
to the presentation V. Note that every simplicial loop in that complex (i.e. loop consisting 
of edges of F) is labeled by a group word W in X which is equal to 1 in G. By the van 
Kampen lemma, there exists a van Kampen diagram over V with boundary label W. That 
van Kampen diagram is a tesselated disc D^, and the labels of its edges give us a map from 
to the labeled graph CT{V) such that the image of dD^ is 7. This immediately implies 
that, 

• if P is finite then A^{1) for CT{V) is always finite for, say, fi = max{|r| | r G R}, 
i.e. GriV) //-simply connected; 

• the Dehn function of G is equivalent to As for every sufficiently large 6 and satisfies 
Bowditch's conditions (-Bi), (-B2) with ^ equal to the square of the maximal length 
of a defining relator. 

Notice the following important topological properties of the Cayley complex. 

(Ci) The Cayley complex is simply connected. 

(C2) The group presentation is finite if and only if the corresponding Cayley complex is 
locally compact. 

2.9. The Dehn functions of groups acting on metric spaces. Geometric group theory 
studies groups acting by isometries on metric spaces. We say that the action of G on X 
is geometric if it is co-compact, and properly discontinuous. Here co-compact means that 
there exists a compact subset Y C X such that X = G ■ Y; properly discontinuous means 
that for every point x £ X there exists a neighborhood U 3 x such that y ■ U OU = for 
every y £ G\{1}. For example the fundamental group of a compact Riemannian manifold 
M acts geometrically on the universal cover of M. 

It is easy to observe (see Gromov |99] ) that if a finitely presented group acts geometrically 
on a simply connected metric space with the coarse space function As (see l2.7.B]l . then the 
Dehn function of G is equivalent to Ag. 

Note that a homological version of the Dehn function of groups can be defined in a similar way |86j . 
The homological Dehn function is easier to compute, and it bounds the ordinary Dehn function from below. 
It has been studied, for example, in |15) . 
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For Riemannian manifolds, the fact that the Riemannian area function (see I2.7.CP is 
equivalent to As is not obvious, although it is a part of "folklore". It was proved in all 
details by Bridson in |35j . 

Theorem 2.25 (Bridson |35j). If a finitely presented group acts geometrically on a simply 
connected Riemannian manifold, then the Dehn function of the group and the corresponding 
filling function of the manifold are equivalent. 

Since 1? acts geometrically on the Euclidean plane by translations, and, as we learned 
in Real Analysis (and ancient Greeks knew without any Real Analysis), the area function 
of is we finally deduce that the Dehn function of 1? is quadratic. 

2.10. Asymptotic cones of metric spaces and groups. 

2. 10. A. The definition. Let X be a metric space with distance function dist, say, a Cayley 
graph of a finitely generated group. Consider a non-decreasing sequence of numbers d = (dj) 
with lim di = oo, called scaling constants and the sequence of metric spaces X/di where W/n 
denotes the space W with distance function (dist is the distance function in W). We 
want to define a limit of this sequence of spaces. One of the reasons is that we want to 
"forget" about local properties of X in favor of global properties. In particular, we want 
to study properties of configurations of points in X which are satisfied by configurations 
with arbitrary large pairwise distances between points, and do not want to be disturbed by 
properties which only hold for configurations of small diameter. The limit space is supposed 
to have the global properties of X while the local properties may disappear. One way to 
define a limit was proposed by Gromov in [96]. The limit is called the Gromov-Hausdorff 
limit but that definition can only be applied when the spaces X/di are uniformly locally 
compact which is true in the case of Cayley graphs of groups of polynomial growth but not 
true in general. A much more universal definition (which coincides with Gromov's when 
his definition applies) was given by van der Dries and Wilkie [229j . The limit is called an 
asymptotic cone of X and is defined as follows. In fact we shall define a limit of any sequence 
of spaces (Xj, distj). As usual for definitions of boundaries and completions of different sorts, 
the limit will consist of sequences {xi), Xi S Xj. Thus consider the direct product Z = liXi. 
The most obvious way to define a distance function on Z is coordinate- wise: 

dist((a;i), {vi)) = lim dist j(xi, y^). 

The problem is that the limit may not exist (we ignore the issue that the limit can be oo 
for now). To circumvent this problem, let us pick a non-principle ultrafilter on N, that is 
a set u of substsets of N which is closed under intersections, super-sets, does not contain 
finite subsets, and with every set C/ C N either [/ or N \ C/ is in tj. The sets from lo are 
called hig, the other sets are small. We can also view u as a. finitely additive measure on 
the set of all subsets of N which has only two values 0, 1, and uj{H) = 1 while a;(0) = 0. If 
Ai is a sequence of statements and Ai holds for all i from a big subset, we say that Ai holds 
uj- almost surely. 

To prove the existence of ultrafilters one needs the Axiom of Choice (or, more precisely, 
a slightly weaker hypothesis, the so called Boolean prime ideal theorem [57j), so although 
we believe that ultrafilters exist, one cannot actually define an explicit ultrafilter using only 
axioms of Zermelo and Frenkel. Anyway, as soon as we fix an ultrafilter we can modify 
the definition of the limit of a sequence of numbers, introducing the definition of the w-limit. 
It is almost exactly the same as the standard calculus definition: if hi is a sequence of real 
numbers, then 

\mi^{hi) = h means that for every e > the set of indices i such that |6j — 6| < e is big. 
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So the only difference with the standard definition of a hmit is that we replaced the words 
"for almost all indices i" by "the set of indices i is big". The fact that uj is an ultrafilter 
immediately implies - exercise - that any sequence of numbers has unique w-limit (it may 
be equal to ±00). For example the limit of the sequence 1, 0, 1, 0, 1, . . . is if the set of even 
numbers is big, and 1 otherwise. 

Now we have a "distance" function on the set HXi: dist{{xi), {yi)) = lim*^ distj(3;j, yj). 
Clearly the triangle inequality is satisfied. Since this function can take infinite values, we 
consider a "connected component". Namely, pick a point o = (oj) in HXi, and let HbXi be 
the set of all points at finite distance from o. This restriction of the "distance function" to 
HbXi is a quasi-distance, because different points can be at distance 0. For example, if two 
points (xj), (yi) are such that Xi = yi for all i from a big set, then the distance between 
these points is 0. The relation {xi) ~ (j/j) iff dist((xi), (y,)) = is an equivalence relation 
and the quotient IlfeXj/ ~ is a metric space with the induced metric. That metric space is 
called the uj-limit of Xi corresponding to the observation point (oj). 

Definition 2.26. The asymptotic cone of a metric space X corresponding to an ultrafilter 
w, an observation point o = (oi), Oi G X, and a sequence di as above is the a;-limit of spaces 
X/di corresponding to the observation point 0. It is denoted by Con'^(X, {dn),o). 

Asymptotic cones of a group endowed with some metric (say, a finitely generated group 
with the word metri^ or a Lie group with Riemanian metric) are asymptotic cones of the 
corresponding metric space. 

2.10.B. Some properties of asymptotic cones. Here are some basic properties of asymptotic 
cones that can be found in [229], [99], [72] and other papers. Some of these properties are 
quite easy to prove, for other properties we provide references. 

(1) If the metric space X is homogeneous (say, X is the Cayley graph of a group), then 
asymptotic cones of X do not depend on the choice of the observation point (so we 
shall always assume, if X is a Cayley graph, that the observation point is (e) and 
we shall omit it from the notation for an asymptotic cone of a group). 

(2) If two metric spaces X and Y are quasi-isometric, then their asymptotic cones corre- 
sponding to the same ultrafilters, same scaling constants and appropriately chosen 
observation points are bi-Lipschitz equivalent, hence homeomorphic. In particu- 
lar, the asymptotic cones (with the same parameters) corresponding to two Cayley 
graphs of the same finitely generated group but different finite generating sets are 
bi-Lipschitz equivalent. 

(3) For every group G the set H^G is a group (with coordinate-wise multiplication) 
acting transitively on any asymptotic cone Con'^(G, {di)) by left multiplication. 

(4) Any asymptotic cone is a complete metric space. 

(5) For every sequence of paths pi : [0, li] — )■ X parameterized by their lengths (/j > 0), 
the function p: [0, lim^/j] — )• Con'^(X, {di), (oj)) (where dist{oi,pi{0)) = 0{di)) is a 
path parameterized by its length. Moreover if pj is an (^, 5i)-quasi-geodesic (i.e. it 
is a (A, i?j)-quasi-geodesic map from the interval onto its image), Bi = o{di), or an 
A-Lipschitz path, then p is an A-bi-Lipschitz path or A-Lipschitz path respectively 
(i.e. an A-bi-Lipschitz or an A-Lipschitz map from an interval to X). 

(6) In particular, the w-limit of every sequence of geodesic paths in X is a geodesic path 
in C = Con'^(X; {di), (oj)) (it can be infinite or a point or empty). It is not true that 
every geodesic path in C is an w-limit of geodesic paths in X but if g is a geodesic 
in C, then for every e > there exists a piecewise geodesic path g' with k = k{e) 
geodesic pieces that is an (x!-limit of fc-piecewise geodesic paths in X, and q' (resp 



'I.e. the metric on the Cayley graph corresponding to a finite generating set. 
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g) is in e-neighborhood of q (resp. g') [71], i.e. the Hausdorff distance between g, g' 
is at most e. 

(7) Suppose that C = Con'^(X; (dj), (oj)) contains a simple geodesic triangle T. Then 
for every e, there exists a, k = k{e) and a sequence of geodesic fc-gons Ilj in X such 
that n = lim"^ Ilj is at Hausdorff distance at most e from T, contains the midpoints 
of the three sides of T, and each Ilj is thick, that is the middle third of each side of 
Ilj is far from the union of the other sides, and every vertex of Ilj is far from the 
vertices not adjacent to it (a precise definition of thick geodesic polygons can be 
found in [74]). 

(8) An asymptotic cone of a finitely generated group may depend on the ultrafilter, and 
the scaling constants (dj) \12S\ I14(H I74|. If the Continuum Hypothesis is false, then 
there exists a group (an uniform lattice in SLn(]R)) with the set of non-homeomorphic 
asymptotic cones 2^*^° . On the other hand if the Continuum Hypothesis is true then 
the set of all asymptotic cones of all countable metric spaces is of cardinality contin- 
uum |140| . and there exists one finitely generated (and recursively presented) group 
with continuum pairwise non-vri-equivalent (hence non-homeomorphic) asymptotic 
cones [73]. There are also finitely presented groups with 2 non-vri-equivalent asymp- 
totic cones |193j . 

2.11. Asymptotic cones and Dehn functions. We have mentioned that asymptotic 
cones of finitely generated groups capture asymptotic properties of groups, i.e. properties 
that manifest themselves on configurations of elements of arbitrary large diameters. For 
example, working with hyperbolic groups (that is groups all whose asymptotic cones are 
M-trees (see I3.1.AP it is convenient to remember that configurations of elements with large 
diameter in a hyperbolic group behave like points on a tree with similar pairwise distances. 
In particular, asymptotic cones of a group reflect the properties of the Dehn function. 

Theorem 2.27 (Gromov [99]). Suppose that all asymptotic cones of a group G are simply 
connected. Then G is finitely presented, has polynomial isoperimetric function and linear 
isodiametric function. 

Proof. The proof of this remarkable statement is very easy. Indeed, if, say, the Dehn 
function / of G grows faster than any polynomial, then for every k > 1 f satisfies inequality 
/(n) > kf{n/2) for infinitely many n. For each k let 7^ be a loop in the Cayley complex of G 
such that Area(7fe) > fc/(|7fc|/2). Let dk be the length of 7^. For some ultrafilter cj, consider 
the asymptotic cone C = Con'^{G, (dk)), and the w-limit 7 of 7^ in C. Then 7 has length 
1. Since C is simply connected by assumption, there exists a continuous map 
with d{g) = 7. Since (p is continuous, and is compact, (p is uniformly continuous. Hence 
there exists an e > and a decomposition of into, say, n triangles of Ai, . . . , A„ such 
that the perimeter of each (pi^i) is at most 1/3. By Property [6] of asymptotic cones from 
I2.10.B| it follows, that we can assume that the sides of all curvy triangles 4>{Ai) except 
for the sides contained in g{dD'^), are w-limits of geodesies from the Cayley graph of G. 
Therefore each (^(Aj) is an w-limit of some sequence of loops (5J^, m = 1, . . ., in the Cayley 
graph. But that means a disc bounded by the loop 7^ can be decomposed into n loops 
(5^ of length < dm/'i + o{dm) w-almost surely for all m > 1. Hence Area(7m) < nf{dm/2) 
cj-almost surely for all m, a contradiction. □ 

Riley |203| proved that under the assumptions of Theorem 12.271 the group also has linear 
FL function (and hence linear FFFL function as well). 

The converse statement of Theorem 12.271 is not true. In [192] . we showed that the group 
(a multiple HNN-extension of a free group) 

{a,b,ti,t2 I tiat~^ = ab,tib = bti,i = 1,2) 
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has cubic Dehn function, linear isodiametric function and non-simply connected asymptotic 
cones. The "cubic" in that statement can be improved to as low as n^logn by modifying 
the group using the method from |191j . It is impossible, though, to have an example with 
quadratic Dehn function. 

Theorem 2.28 (Papasoglu |196| ). All asymptotic cones of a finitely presented group having 
quadratic Dehn function are simply connected. 

As Theorem 12.271 this is a very important fact, but the idea of the proof is not that 
difficult. We take a van Kampen diagram A over the presentation of our group G, and start 
removing layers of cells one by one from the outside in, peeling the diagram as an apple, so 
that each layer is 1-cell wide. The fact that the area is quadratic lets us estimate the number 
of layers and their lengths. This allows us to decompose the diagram into a constant, say, c, 
number of subdiagrams whose perimeters are at most half of the perimeter of A. That, in 
turn, implies that every loop 7 in any asymptotic cone of G can be decomposed into 0(1) 
loops of length |7/2|. This implies simple connectivity of the asymptotic cone. 

An analysis of Gromov's proof of Theorem 12.271 allowed Papasoglu to obtain the following 
useful result. 

Theorem 2.29 (Papasoglu [73]). Suppose that all asymptotic cones of G are simply con- 
nected and have Gromov coarse area function As{l) bounded by Then the Dehn function 
of G is bounded by n^"*"^ for every e > 0. 

That result was used by Drutu in [73] (see also I3.2.A1 and H.l.Al below). 

Note that Theorem 12.281 does not give metric properties of asymptotic cones, only the 
topological property of being simply connected. In a simply connected asymptotic cone 
even of a nice nilpotent group, a Lipschitz loop does not necessarily bound a Lipschitz 
2-disc or even an integral 2-current (it can be deduced from [6] for the Heisenberg group 
i?3, for other examples see Wenger [235], and l3.2.C]l . But for groups with quadratic Dehn 
functions, the asymptotic cones are much nicer. 

Theorem 2.30 (Wenger |235| ). Let G be a finitely presented group with quadratic Dehn 
function. Then every asymptotic cone of G admits a quadratic isoperimetric inequality in 
terms of 2-currents, that is for some constant C , every Lipschitz loop 7 bounds an integral 
2-current of area < C\j\'^. 



3.1. Hyperbolic groups and asymptotic cones. 

3.1. A. The definition and a characterization. There are several equivalent definitions of 5- 
hyperbolic (in other terminology, word hyperbolic) metric spaces and groups [98j . Rips' 
definition is the following. 

Definition 3.1. A geodesic metric space X is 5-hyperbolic for some 6 >0 if every geodesic 
triangl^ in X is 6-thin, that is every side of it is in the closed (^-neighborhood of the union 
of the other two sides. A finitely generated group G is 5-hyperbolic if its Cayley graph 
is 5-hyperbolic. A geodesic metric space (a finitely generated group) is hyperbolic if it is 
5-hyperbolic for some 5 > 0. 

This property is obviously invariant under quasi-isometry, so, in particular, hyperbolicity 
of a finitely generated group does not depend on the choice of a (finite) generating set. 
Note that if X is (5-hyperbolic, then for every real d > we have that X/d is 5/d- 



hyperbolic. This and Properties [6t 171 from 12. 10. Bl immediately imply that every asymptotic 



3. Isoperimetric functions of important groups 



That is a triangle where aU three sides are geodesies. 
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cone of a hyperbolic geodesic metric space is 0-hyperbolic. Hence in every geodesic triangle 
in the asymptotic cone every side is contained in the union of the other two sides. This 
means that every asymptotic cone of a hyperbolic geodesic metric space is an M-iree. Thus 
all asymptotic cones of a hyperbolic group are complete M-trees. 
The converse statement is true also. Thus we have 

Theorem 3.2 (Gromov [981 [99]). ^ finitely generated group G is hyperbolic if and only if 
every asymptotic cone of G is an M-iree. 

It turns out that the M-trees that can appear as asymptotic cones of groups are of three 
kinds only because of the following recent theorem of Sisto. 

Theorem 3.3. (Sisto |220[ Corollary 5.9]). If an R-tree T is an asymptotic cone of a group, 
then it is a point, a line or an M-tree with valency 2^^ a,t each point (i.e. removing every 
point of the tree separates the tree into 2*^^^ connected components) . 

Remark 3.4. In view of |169[ 11591 l^^j. by Theorem 13.31 there are 3 possible isometry types 
of M-trees appearing as asymptotic cones of groups: the point, the line, the tree with valency 
2^° at every point. 

Remark 3.5. By [741 Proposition 6.1], an asymptotic cone of a group is a line if and only 
if the group is virtually cyclic^. It is obvious that an asymptotic cone is a point if and only 
if the group is finite. 

3.I.B. First examples of hyperbolic groups. Finite and free groups, and more generally free 
products of finite and cyclic groups are examples of hyperbolic groups. 

The classical (motivating) examples of hyperbolic groups are fundamental groups of com- 
pact negatively curved Riemannian manifolds. In particular co-compact lattices in Lie 
groups S'0„^i(M) are hyperbolic because these are fundamental groups of compact Rie- 
mannian manifolds of constant negative curvature. Hence if the genus of a closed oriented 
surface Sg is at least 2, then the fundamental group of Sg is hyperbolic. It has the presen- 
tation 

...,Xg,yg \ [xi.yi] . . . [xg,yg\ = 1) 

found by Dehn. It is easy to see that no two cyclic shifts of the relator or its inverse have a 
common prefix of length more than 1. Thus this presentation satisfies the small cancelation 
condition G'{j^). 

3. I.e. The small cancelation conditions. 

Definition 3.6. Let A > 0, and let V = {X \ R) he a group presentation. We say that V 
satisfies the condition C"(A) if the length of any common prefix of any two different cyclic 
shifts of words ri,r2 G R^^ (called a piece) is strictly less than Amin(|ri|, |r2|) (recall that 
all words in R are cyclically reduced by our assumption (see 12.1. Ah , so cyclic shifts of these 
words and their inverses are reduced words). 

The main property of C"(A)-presentations is the following Greendlinger Lemma [147] . 
proved by Dehn in the case of 7ri(S'g), g>2. 

Theorem 3.7 (See Lyndon, Schupp [147j ). Let V = {X \ R) he a group presentation 
satisfying C'{\), A < 1/6, then every minimal van Kampen diagram A over V either has 
no cells or has a cell vr with 5(7r) = uv with \u\ > \v\, and u is a subpath of dA (this cell is 
called a Greendlinger cell^. 



Recall that a group possesses a property P virtually if it has a finite index subgroup possessing P. 
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Theorem 13.71 implies that in every group where the (finite) presentation V satisfies C"(^) 
the word problem can be solved by the following Dehn algorithm. Take a reduced word w. 
If a cyclic shift w' of w contains more than a half of a cyclic shift of a defining relation r 
or its inverse (r' = uv, w' = uw", \u\ > \v\), then replace uw" by v~^w" and reduce the 
resulting word. The new words is equal to 1 modulo V if and only \iw = l modulo V. Since 
v~^w" is shorter than w, we can continue until we either get the empty word (in which case 
we conclude that w = 1 modulo V) or we get a non-empty word for which the shortening 
is no longer possible (in which case we conclude that w ^ 1 modulo V). 

Note that every group possessing a finite presentation for which Dehn's algorithm works 
(we shall call this a Dehn presentation) has linear Dehn function because removing Greend- 
linger cells from a minimal van Kampen diagram reduces the length of the boundary (hence 
the area of the diagram cannot exceed the perimeter). 

3.I.D. The linear isoperimeric inequality. Gromov proved that every hyperbolic group has 
linear Dehn function and in fact possesses a finite Dehn presentation. Here is a relatively 
short proof of this fact using asymptotic cones. 

Theorem 3.8 (Gromov [98]). Every hyperbolic group is finitely presented and has a finite 
Dehn presentation, and hence linear Dehn function. (Thus a group is hyperbolic if and only 
if it has a finite Dehn presentation.) 

Proof. We present here a proof that illustrates the usefullness of asymptotic cones. Let G 
be a hyperbolic group. We need the following statement first. A path 7: [0, — )• F in the 
Cayley graph F of a group G is called a bump if the the length I of the path is at least 3 
times bigger than the distance between its end points. 

Lemma 3.9. There exists a constant C such that every bump of length I > G contains a 
sub-bump of length at most ^l. 

Proof. Suppose that such a constant does not exist. Then for every n = 1,2, . . . there exists 
a bump 7„ of length dn > n which does not contain sub-bumps of length < (i„/3. Pick an 
ultrafilter lo and consider the asymptotic cone C = Con'^(G, (di)). The space C contains the 
(x!-limit 7 of paths 7^. Then by Property ([5|) of asymptotic cones from 12. 10. Bl 7 is a path of 
length 1 and the distance between its endpoints is at most 1/3. Recall that C is an M-tree 
(Theorem 13. 2p . A path of length 1 on a tree with enpoints at distance < 1/3 must have a 
subpath of arbitrary small (but non-zero!) length which is a loop. Let a be that loop of 
length at most j, a he the beginning (= the end) point of that loop. Then a is a double 
point on 7: o = ^{p) = 7(p -|- e) where £ < j- That means the distance between ji{p) and 
7i(p-|-e) is o{di) while the length of the subpath 7j between p and p + e is 0{di) and at most 
\di + o{di) < ^di (^-almost surely. Therefore for infinitely many values of i the subpath 
ji{p,p -|- e) is a bump of length at most ^di, a contradiction. □ 

Now Theorem 13.81 can be deduced as follows. Let G be a hyperbolic group. Let G be the 
constant from Lemma 13. 9[ Let R be the (finite) set of all words in the generators X oi G 
which are of length < 2G and equal to 1 in G. Lemma 13.91 immediatelv implies that every 
loop in the Cayley graph of G has a subpath that is a bump of length at most G (note that 
a loop is a bump too). The bump together with a geodesic connecting its endpoints is a 
loop of length at most G + ^G < 2G. Therefore every group word in the alphabet X that 
is equal to 1 in G contains more than a half (in fact, at least |) of a relator from R. This 
implies that {X | i?) is a finite presentation of G satisfying the Dehn property. □ 

The converse statement for Theorem 13.81 is also true and well-known: 
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Theorem 3.10 (Gromov [98]). A finitely generated group is hyperbolic if and only if it has 
linear isoperimetric inequality. 

Exercise 3.11. Prove the "if" part of Theorem \3.10\ using asymptotic cones. (Hint: use 
Property from \2.10.Bl ) 

Exercise 3.12. Prove all other basic properties of hyperbolic groups from |98j using asymp- 
totic cones. (Asymptotic cones do not give precise values of constants so you should replace 
estimates like 400^500 by "some constant" .) 

3. I.E. Equations in hyperbolic groups. Several other algorithmic problems have nice solu- 
tions in hyperbolic groups too. For example, Theorem 12.111 about solvability of quadratic 
equations is true if we replace there "free" by "hyperbolic" |174j . All solutions of quadratic 
equations without coefficients in a hyperbolic group can be described by the following 

Theorem 3.13 (Lysenok |148] ). Let S be a closed surface. For every hyperbolic group G 
there exist only finitely many homomorphisms of'Ki{S) to G up to the action by the mapping 
class group of tti{S) and the action o/Aut(G) on G. 

The proof of this theorem is relatively elementary comparing with the similar looking 
Theorem ElSl 

An analog of Makanin's Theorem 12.151 for free groups is also true for torsion- free hyper- 
bolic groups in general. 

Theorem 3.14 (Rips, Sela |205j ). There exists an algorithm to check whether a system of 
equations over a given torsion-free hyperbolic group has a solution. 

3.I.F. The language theory point of view. A language is a set of words in a finite alphabet 
X. An automaton is a finite labeled directed graph (labels are taken from a finite alphabet 
X) with two distinguished subsets of vertices S (start) and F (finish) |50j . For example, 
any finite subgraph of a labeled Cayley graph becomes an automaton if we choose the start 
and finish vertices. A language L is recognized by an automaton M if it consists of all words 
that can be read on M starting from a start vertex and ending at a finish vertex. Such 
languages are called regular. 

It is easy to see that the language of all group words in generators of a finitely generated 
group that are equal to 1 in that group (let us denote it by W{G)) is regular if and only 
if the group is finite. For a bigger class of context-free languages, i.e. languages recognized 
by more advanced pushdown automata (for the definition see |125j ). the situation is already 
different as shown by the following non-trivial theorem. 

Theorem 3.15 (Muller, Schupp [166] ). Let G be a finitely generated group. G is free if and 
only if W{G) is context-free and G is torsion-free. G is virtually free if and only if W{G) 
is context-free and accessible. 

Recall that a group G is called inaccessible if it contains an infinite sequence of subgroups 
G > Gi > G2 > . . . and a sequence of actions by isometries on simplicial tree£3 Tj such that 
Gi is the stabilizer of a vertex in Tj and all stabilizers of edges are finite. An inaccessible 
finitely generated group was constructed by Dunwoody [75]. A group is accessible if it is 
not inaccessible. All finitely presented groups are accessible |76j . 

By Holt |123j . the word problem in every hyperbolic group is real time recognizable, which, 
informally, means that it can be recognized by the Turing machine that reads its input at 
a constant rate, and stops at the end of the input (regular and context-free languages are 
real time recognizable). This implies, in particular, 



A simplicial tree is just a loopless connected graph in the sense of Serre [218| . 
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Theorem 3.16 (See, for example, Holt [123] ). The word problem in every hyperbolic group 
can be solved in linear time by a (multi-tape) Turing machine. 

Note that the fact that the Dehn function is linear directly implies only that the word 
problem can be recognized in linear time by a non-deterministic Turing machine. The 
proof of Theorem 13.161 uses the fact that every hyperbolic group has a Dehn presentation 
(Theorem 13. 8p . In |124j . Holt and Rees used the fact that nilpotent and some relatively 
hyperbolic groups have so-called generalized Dehn presentations introduced earlier by Can- 
non, Goodman and Shapiro. They prove that groups from these classes also have real time 
recognizable word problem. 

For more information about groups with word problem from some more complicated 
language classes see Oilman |9T| . 

3.I.G. The isomorphism problem for hyperbolic groups. The isomorphism problem for hy- 
perbolic group^^ is decidable. This is a very strong theorem proved in the torsion-free case 
by Sela (see j217j for a weaker result, and j63j by Dahmani and D. Groves for the full proof) 
and in the general case by Dahmani and Guirardel |65j . 

Here we present a short description of the solution of isomorphism problem which is a 
somewhat modified text sent to us by Prangois Dahmani. 

Let G = {X I ri, . . . , r^) and G' = {X' \ r^, . . . , r^) be two hyperbolic groups. Suppose 
first that every action of each of the groups G, G' on a simplicial tree is trivial, i.e. fixes 
a point. Every homomorphism (j): G ^ G' induces an action of G on the Cayley graph of 
G': a g = (f){a)g. Let d^f, be the maximal number such that every vertex of the Cayley 
graph of G' is moved by at least by some generator of G under this action. It is easy 
to check that if a sequence of homomorphisms (pi, (j)2, ■ ■ ■ consists of pairwise non-conjugate 
in G' homomorphisms, then the sequence of numbers d^. is unbounded, hence G acts non- 
trivially on the asymptotic cone of G' corresponding to the scaling sequence (d^.) which 
is an R-tree by Theorem 13.21 In fact one can deduce (as in Sela [217^ Theorem 9.1] using 
Rips' theory of groups acting on M-trees and its generalizations) that in this case G acts 
non-trivially on a simplicial tree as well. Hence we can assume that any set of pairwise 
non-conjugate homomorphisms G — )• C and C — t- G is finite. 

Therefore in order to decide whether G and G' are isomorphic we need to compute a finite 
list of homomorphisms containing a representative of each of the homomorphisms G — >• G' 
(up to conjugacy), and a similar finite list of homomorphisms G' — >• G. Assuming that this 
can be done, one checks whether a composition of a homomorphism from the first (second) 
list with a homomorphism from the second (first) list is conjugate to the identity map, that 
is whether the images of the generators of G (of G') under this composition are conjugate 
to the generators of G (of G') by the same conjugator. It is a system of quadratic equations 
(with one variable - the conjugator) and can be solved essentially as one quadratic equation 
(see I3.1.EP , or using the general technique by Rips and Sela |205j . 

There is an obvious infinite procedure to write down these lists: check all maps X — )• G' 
and X' — )• G one by one, and verify which maps induce homomorphisms. The problem is 
to understand when to stop: that is when a list contains representatives of all conjugacy 
classes of homomorphisms. For this, the key idea is to solve systems of equations. Indeed, 
morphisms G — )■ G' are in 1:1 correspondence with solutions in G' of the system of equations 

f n = i 

[ rm = l 



This problem has as the input two presentations of groups that are known to be hyperbohc, and asks 
whether the groups are isomorphic 
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with generators of G viewed as unknowns. Adding certain inequations (of the form u; 7^ 1), 
one can encode the property of being different from homomorphisms that are akeady on the 
Ust. Thus we can easily write a system of equations and inequations that has no solution if 
and only if we have collected a finite list containing representatives of all homomorphisms. 
The difficulty here is that we want not only that the solution be different from the recorded 
morphisms, but we want that it is not conjugated to any of them. For that, we record only 
homomorphisms G — )■ C that are short meaning that one cannot shorten the total length 
of the images of x G X by "obvious" conjugations in G' (such as cyclic shifts). The key 
points are that, first, shortness can be interpreted as the membership in a regular language 
(we view a homomorphism cj): G ^ G' as one large word containing all words (j){x),x S X 
separated by special symbols); that membership is called a regular constraint, and, second, 
that the number of short homomorphisms is finite. The fact that some kind of shortness 
of a homomorphism can be recognized by an automaton is similar to Lemma 13.91 above 
which characterizes words in a hyperbolic group without arbitrary bumps as words which 
do not contain bumps of at most constant length. It is easy to see that for every finite 
set of words W, the set of all words in a finite alphabet that do not contain subwords 
from W is regular. Thus the certificate for completeness of our list of homomorphisms is 
unsolvability of the system of equations and inequations subject to the regular constraint 
described above. Thus one needs to generalize Makanin's Theorem 12 . 1 5 1 and Rips and Sela's 
Theorem 13. 141 to systems of equations and inequations subject to regular constraints in not 
necessarily torsion-free groups. This has been done in a long series of papers including 
[T52l[2T5l [70l[62l[Ml[Ti3]. 

If the groups G, G' may act on simplicial trees without fixed points, the strategy is to 
find a canonical action on a tree where the vertex stabilizers are in a sense the smallest 
possible for both G and G' (the so-called J S J- decomposition introduced by Rips and Sela 
|204j ). and then study vertex stabilizers as in the previous paragraph. 

3.I.H. More examples of hyperbolic groups. As we have seen, every group having a presen- 
tation with the small cancelation condition C"(l/6) is hyperbolic (see l3.1.Dh . Moreover 
almost every finitely presented group is hyperbolic |98j . That statement was proved in 
[98l [TH 11761 1102j , but although the statements proved there have basically the same formu- 
lations, the results are different because of different meanings of the words "almost every", 
or, more precisely, different choices of the probabilistic model. 

3.1. HI. Few relator model. 

In the few relator model (also known as the asymptotic density model or Arzhantseva- 
Olshanskii model introduced by Gromov in [98] and Olshanskii in late 80s, we choose an 
alphabet of generators A and a number of relations k, and then for every n > 1 choose k 
group words ri, . . . ,rk of length < n in the alphabet A, and consider the group given by 
the presentation {A \ ri = 1, . . . ,rk = 1). Let N{n) be the number of all presentations 
we get this way, and A'h(ra) be the number of presentations of hyperbolic groups among 
them. Then the quotient -^^^ tends to 1 very rapidly with n (see Arzhantseva-Olshanskii 
[llj). In fact, as proved in [11], in the few relator model, we can count even the number of 
presentations with small cancelation C(A) for any fixed A > 0, and the quotient will tend 
to 1 almost as rapidly. Note that in [11], Arzhantseva and Olshanskii proved that almost 
all groups having presentations with m generators and k relators have the property that 
all their (m — l)-generator subgroups are free (that strengthened a previous result of Guba 
about 2-generated subgroups of 1-related groups with at least 4 generators [108] ) . The same 
or similar models were used by Arzhantseva and Champetier [3 [8l [9l Wl\ HH] where other 
generic properties of groups in the few relator model were found. 
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3.1.H2. Different lengths model. 

If we allow lengths of different relators satisfy different length restrictions, we obtain a 
stronger theorem. 

Theorem 3.17 (Formulated by Gromov [98], proved by Olshanskii |176j ). Let A he an al- 
phabet, \A\ = k> 2. Let d>0 and let (ni, • • • ,nd) be an increasing sequence of positive in- 
tegers. Let N = N{k, i,ni, . . . , Ud) be the number of group presentations G = (^|ri, • • • , r^) 
such that ri , • • • , r^^ are reduced group group words in the alphabet A such that the length 
of rj is Hj for j = 1,2, . . . ,i. If A^h is the number of infinite hyperbolic groups in this 
collection, then for every e > there exists n = n{e, d) such that for every choice of 

k,ni > n,ni < . . . < the quotient ^(^1^'^^' " ^^^ exceeds 1 — e. 

The probabilistic model used in this theorem is very different from the few relator model. 
In particular, the probability of having a small cancelation presentation tends to in that 
model (say, if we assume that w expni). 

3.1.H3. Gromov 's random groups model. 

If we also allow varying the number of relations d, we obtain one of the probabilistic 
models used by Gromov in constructing his random groups |1U2| (see also Gromov (SU] and 
Arzhantseva-Delzant [TO] ) . 

Here is the definition of the Gromov's random groups model. 

Let r = (V, E) be an oriented graph and X be a finite set. A labelling of F is a map 
a: E ^ XU X~^. Let W be the normal subgroup of the free group -7ri(F,p) (where p ^ V) 
generated by all words that label a loop in F based at p. Let Ga be the factor-group 



This map obviously defines a homomorphism q* from the free group 7r]^(F,p) to Gq^, 
where p is a vertex in V. The kernel of that map is generated (as a normal subgroup) by all 
words that label loops of F. Note that by construction there exists a natural homomorphism 
a+ from the graph F to the Cayley graph of G induced by a. The general questions to 
ask about G is whether G is infinite and hyperbolic, whether a"*" is "almost an isometric 
embedding" , and whether a* is injective on a large enough ball in the Cayley graph of the 
free group 7ri(F,p). The set of all labellings {X U X~^)^ has a natural uniform probability 
measure, and a random choice of a labelling a gives a random group G^. 

In particular, if F is a disjoint union of finite number of circles of lengths ni, . . . , n^, we 
get a random model similar to the different lengths model only the number of relations may 
be big comparing to the lengths of them. In fact, if the total number of possible labelings 
is exp Gn, where n is the maximal length of a cycle, for some constant G, and rij do not 
differ very much from n, then d can be chosen as large as exp aGn for some < a < 1. The 
number a is called the density. If the density is > 1/2, then the random group is trivial 
with probability close to 1 (assuming that n — t- oo), but if q < ^, then the group is infinite 
and non-virtually cyclic hyperbolic with probability tending to 1 as n — >• oo. 

It is important to note that, as with the case of small cancelation presentations, we can 
impose the random relations not only on the free group, but on any non-virtually cyclic 
hyperbolic group. Thus if Fj is a sequence of graphs, we obtain a sequence of random 
non-virtually cyclic hyperbolic groups and surjective homomorphisms 



similarly to lS.l.JI 

Particularly strong results are obtained in |102j when the graph F runs over an expander 
sequences of graphs, that is /c-regular finite graphs having the following property: the 
second highest eigenvalues of their adjacency matrices is bounded away from the highest 



TTiiV, p)/W. 



(5) 
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eigenvaluJ^. If the expander sequence is chosen well enough, the Cayley graph of the 
inductive limit G of the sequence [5] contains "almost isometric" copy of the union of Fj , 
and so G cannot be coarsely embedded into a Hilbert space, every action of G on a CAT(O) 
manifold has a global fixed point, etc. For more details see the book by Ollivier [172j and 
I. Kapovich and Schupp |133j . 

3.1.1. Almost all 1-related groups are residually finite. Note that for 1-related groups all 
three models give the same results. A random 1-relator group with at least 2 generators is 
infinite and hyperbolic. The following theorem says that these groups have some other nice 
properties as well at least when the number of generators is > 3. 

Theorem 3.18 (Sapir, Spakulova |211j ). A random 1-related group with at least 3 genera- 
tors is 

(1) residually finite; 

(2) virtually residually (finite p-)arow^^ for all hut finitely many primes p (i.e. it has 
a finite index subgroup whose homomorphisms onto finite p-groups separate all ele- 
ments); 

(3) coherent (i.e. all finitely generated subgroups are finitely presented). 

The proof employs facts from different areas of mathematics: properties of Brownian mo- 
tions in M'' [M]; some dynamical properties of polynomial maps over finite fields (existence 
of periodic quasi- fixed points) [25] and p-adic completions of number fields [26]; existence 
of dense free subgroups in p-adic Lie groups [32]; the congruence extension property of 
subgroups of the free groups generated by sets of words satisfying some small cancelation 
condition jl80] : the fact that every asceding HNN-extension of a free group is coherent 
|81j . and a characterization of 1-related groups with 2 generators that are ascending HNN- 
extensions of free groups 01]. It turned out also that in Theorem 13.181 one can count not 
just presentations but groups up to isomorphism (which is obviously harder) due to a very 
strong result of I. Kappovich, Shpilrain, Schupp |134j . See |209j for more details. 

Note that Theorem 13.181 gives many examples of residually finite hyperbolic groups be- 
cause random 1-related groups are hyperbolic (and even have small cancelation presenta- 
tions). But we still do not know if all hyperbolic (and even small cancelation) groups are 
residually finite |98| I99j. The general belief is that the answer is "no", and there are many 
potential counterexamples (see, say, |130| ). but it is very hard to show that a particular 
hyperbolic group is not residually finite because hyperbolic groups typically have many 
quotients. 

3.1. J. Quotients of hyperbolic groups. One of the key properties of hyperbolic groups dis- 
covered by Gromov in |98j is that they behave similar to the free groups as far as the 
multitude of quotients is concerned. 

For example, Delzant and Olshanskii independently proved the following 

Theorem 3.19 (Delzant [67] . Olshanskii jl80] ). Every non-virtually cyclic hyperbolic group 
G is SQ-universal, that is every countable group embeds into a quotient of G. 

A version of the following very useful theorem was stated in |98j in a slightly wrong 
form. It was corrected and proved in full generality in |179j . We give a not the strongest 
formulation here. 

Equivalently, an expander sequence of graphs can be defined as a sequence of fc-regular graphs F; = 
(VijEi) such that for every subset M C Vi of vertices of Fi with \M\ < \Vi\/2, ^pg^ > A for some constant 
A > |144) . Here dM is tire set of edges with one vertex in AI and the other in V \ AI . 

^^A group is called residually (finite p-) group, p a prime, if the intersection of all its subgroups with 
indexes powers of p is trivial. 
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Theorem 3.20. (Olshanskii [179^ Theorem 2]). Let G be a hyperbohc group and Hi, . . . Hn 
be non-virtually cyclic subgroups of G which do not normalize non-trivial finite subgroups 
of G, M a finite subset of G. Then there exists a non-virtually cyclic hyperbolic quotient 
Gi of the group G such that the natural map G — >• Gi is surjective on each Hi, . . . , Hn and 
injective on M. 

It is easy to deduce from Theorem 13.201 that every noncyclic torsion- free hyperbolic group 
has a quotient which is a torsion-free Tarski monster (i.e. it is torsion-free and all proper 
subgroups are cyclic); every non-virtually cyclic hyperbolic group has an infinite torsion 
quotient; every noncyclic torsion- free hyperbolic group has an infinite quotient all of whose 
proper subgroups are finite (see [98]). These groups with "extreme" properties are obtained 
as inductive limits of hyperbolic groups and surjective homomorphisms 

G — )• Gi —)•... —T- Gn — )• . . . 

similar to l3.1.H3l 

It is also easy to deduce that every two non-virtually cyclic hyperbolic groups Hi, H2 
have a common non-virtually cyclic quotient |981 5. 5. A] (consider the free product G = 
Hi/ E{Hi) * H2/ E{H2) where E{H) is the maximal finite normal subgroup of H). It was 
observed by Osin, that it implies, by induction, that there exists a finitely generated non- 
virtually cyclic infinite group Goo that is a common quotient of all non-virtually cyclic 
hyperbolic groups. One can easily see that Goo has Kazhdan property (T), is generated by 
an element of order 2 and an element of order 3, and also by two elements of orders p and 
q for any two primes with pq > 6 (since Goo is a quotient of the free product TLjpTL * TLjqlL^. 
In particular Goo is not residually finite, and in fact does not have any non-trivial finite 
homomorphic image Gq (otherwise Go would contain an element of order p> | Go | which is 
impossible). Note that Goo is not unique, and depends, in particular, on how we enumerate 
all hyperbolic groups. Still since the set of all finite hyperbolic group presentations is 
recursively enumerable, we can assume, using |179j . that Goo has a recursive presentation. 

Theorem 13.201 also allows one to construct a finitely generated infinite torsion group 
(and even with all proper subgroups cyclic), see [98]. It does not allow to get a bound on 
the exponents of elements of that group. It was done in papers by Olshanskii |178j (odd 
exponents) and S.V. Ivanov and Olshanskii |130( Theorem A, Lemma 19] (even exponents) 
where the following remarkable theorem was proved. 

Theorem 3.21 (Olshanskii |178| . S.V. Ivanov, Olshanskii [130] ). For every non-cyclic tor- 
sion-free hyperbolic group G there exists an odd integer m = m{G) > and a integer 
n = n(G) > 0, such that the following hold, (a) The quotient groups G/G"^, GjG'^^ are 
infinite^, (b) The word and conjugacy problems are solvable in G/G"^,G/G'^" . 

That result was the strongest in the very important series of results on the bounded 
Burnside problem including the celebrated Novikov-Adian's solution for odd exponents 
|17H [T], and extremely involved S.V.Ivanov's solution for even exponents [128j . 

3.I.K. The small cancelation once more. The main ingredient in the proofs of Theorems 
I3.19|,[3l20l and the results in |102j is small cancelation over hyperbolic groups. The following 
coarse version of small cancelation from Olshanskii-Osin-Sapir |184] is close to the ones used 
in papers [67 1 [T79l [T80] . 

Definition 3.22. Let H he a group generated by a set X. Let 7?. be a symmetrized set of 
reduced words in X^^ (that is a set of words closed under taking cyclic shifts and inverses). 



^"^Recall that G*^ denotes the (normal) subgroup of G generated by aU fcth powers of elements of G; hence 
G/G^ has exponent dividing k. 
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For e > 0, a subword C/ of a word R £ TZ is called a e-piece if there exists a word R' £ TZ 
such that: 

(1) R = UV, R' = U'V, for some V, U', V; 

(2) U' = YUZ in H for some words Y, Z such that max{|y|, \Z\} < e; 

(3) YRY-^ ^ R' in the group H. 

Note that if U is an e-piece, then U' is an e-piece as well. 

Definition 3.23. Let e > 0, ^ € (0, 1), and p > 0. We say that a symmetrized set TZ 
of words over the alphabet X^^ satisfies the small cancelation condition C{e,fi,p) over a 
hyperbolic group H = {X) , if 

(Ci) All words from TZ label geodesies in the Cayley graph of H corresponding to X; 

(C2) \R\ > p for any R £ TZ; 

(C3) The length of any e-piece contained in any word R £TZ is smaller than p\R\. 

Figure [10] shows the difference between the usual small cancelation (see Definition 13. 6|) 
and the coarse version of it. The key feature of the coarse version of small cancelation is the 
coarse Greendlinger Lemma (see [1841 Lemma 4.6]). The role of that lemma is very similar 
to the role of the usual Greendlinger Lemma (Theorem 13. 7p - see [179^ 1180^ I184j for more 
details. 




Figure 10. The standard and coarse definitions of a piece. 



3.I.L. Combination theorems for hyperbolic groups. Another way to construct new hyper- 
bolic groups is to use various combination theorems. Suppose that a finitely generated 
group G acts by automorphisms on a simplicial tree T and there are no invariant proper 
subtrees for this action. From the Bass-Serre theory |218j . the group G is a multiple HNN- 
extension of the stabilizer of a vertex with associated subgroups isomorphic to stabilizers 
of edges if and only if the action is transitive on vertices; and G is an amalgamated product 
of stabilizers of two vertices with the stabilizer of an edge as an amalgamated subgroup 
if and only if the action has two orbits of vertices and two orbits of edges. In general, G 
can be composed from the stabilizers of vertices using HNN-extensions and amalgamated 
products. A statement saying that some property P holds for G provided it holds for sta- 
bilizers of vertices of the action of G on T (plus some condition on the stabilizers of edges) 
is called a combination theorem. For example, the statement "if all stabilizers of vertices 
are torsion-free, then G is torsion-free" is such a theorem. 

A very general combination theorem for hyperbolic groups is proved by Bestvina and 
Feighn \17\ I18j. There are several other versions proved in \92 \ 11361 1160| and other papers. 
Here we present the "greatest common divisor" of all these statements. It was proved in 
each of these papers, and reflects the nature of all combination theorems. 

Theorem 3.24. If G is a hyperbolic group, A and B isomorphic virtually cyclic sub- 
groups, (f)-. A ^ B is an isomorphism between these subgroups, then the HNN-extension 
H = HNN(G; A, B, cj)) is hyperbolic if and only if for every g £ G the set A n gBg^^ is 
finite. 
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3.I.M. The subquadratic isoperimetric inequality. The "if part" of Theorem 13.101 can be 
strengthened significantly: one can prove that even a quadratic isoperimetric inequahty 
with small enough coefficient of the square implies hyperbolicity |98j . 

Theorem 3.25. (Gromov [981 6.8], Olshanskii [177], Bowditch [29], Papasoglu [US]). There 
exists a universal constant v such that the following holds. Let G = {X | i?) be a group 
presentation, p = max(|r|,r G R). If the Dehn function / of G satisfies 

fin) < —n' 

for all sufficiently large n, then G is hyperbolic. 

One can actually estimate the universal constant v by analyzing the proof of Bowditch 
[28], for example. The estimate would be far from optimal, though. Stefan Wenger informed 
us that one can take any v < ^: it follows from his paper |234j . It is an interesting question 
to find exact value of v or at least a good approximation. Note that for the free Abelian 
group with the standard presentation {x,y \ [x,y] = 1), the Dehn function is f{n) = jQn^ 
(it essentially follows from l2.6TCi) and p = 4. So v cannot be bigger than 1. 

Bowditch [29] proved actually a much more general result about all geodesic metric spaces 
with area functions satisfying Conditions (Bi) and (-B2) from l2.7^ (the role of is played 
by the constant k from (-62))- Wenger 's results in [234j are also much more general and 
apply to arbitrary geodesic metric spaces as well. In fact he proves that the isoperimetric 
inequality of the Euclidean plane, 

Areaf/) < —l^ 

is optimal in the sense that every geodesic metric space with stronger isoperimetric inequal- 
ity (for some natural area function) is hyperbolic. The crucial tool in his proof is metric 
currents as in l2.7.C[ 

3.I.N. The Cartan-Hadamard type theorem. Cartan-Hadamard type theorems are "local- 
to-global" statements in group theory: we deduce information about the global structure 
of a group G by looking at one (sufficiently large) ball in the Cayley graph of 

Theorem 3.26. (See Gromov [Ml 6.8M], Bowditch [251 Theorem 8.1.2]). There are con- 
stants Ci, C2, and C3 with the following property. Let F be the Cayley graph of a finitely 
presented group where the length of every defining relation is at most d and every ball of 
radius G2d in T is Cic?- hyperbolic. Then T is Csd- hyperbolic. 

An analogous result for metric spaces is also true (see [98]). The proof in [98] is based 
on the fact that Theorem 13.251 remains true even if restrict the assumption to relatively 
short loops: if every relatively short loop in F has quadratic area with sufficiently small 
coefficient, then the group is hyperbolic. 

This theorem and its variations are very important in the theory of hyperbolic groups. 
A version of it from [lOlj is used in the paper by Gromov and Delzant [68] (see also Coulon 
[60]) to provide possibly the most conceptually easy proof of the theorem of Novikov-Adian 
[171j and Olshanskii [178j . Another version of Theorem 13.261 for CAT(O) spaces is proved in 
Bridson and Haefliger [37]. A yet another version was recently proved by Shalom and Tao 
[216j : it significantly strengthens the polynomial growth result of Gromov [96] (mentioned 



Recall that the original Cartan-Hadamard theorem states that the universal cover of a Riemannian 
maniford of non-negative sectional curvature (a local property) is diffeomorphic to a Euclidean space (a 
global property). 
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Theorem 3.27 (Shalom, Tao |216] ). For some constant C, the following holds for every 
finitely generated group G, and all d > 0. If there is some Rq > exp(exp(C(i'^)) for which 
the number of elements in a ball of radius Rq in a Cayley graph of G is bounded by Rq, then 
G has a finite index subgroup which is nilpotent (of nilpotency class < C^). 

Problem 3.28. It would be interesting to find a common cause of all these (and other) the- 
orems of Cartan-Hadamard type. Perhaps a recent model theoretic approach of Hrushovski 
will give such a common cause (see [1261 Section 7]). 

3.1.0. Lacunary hyperbolic groups. It was observed by M. Kapovich and Kleiner (see the 
Appendix to |184j ) that Theorem 13.261 almost immediately implies the following stronger 
version of Theorem 13. 2[ 

Theorem 3.29 (See |184] ). Let G be a finitely presented group with one asymptotic cone 
an M-^ree. Then G is hyperbolic (and hence all asymptotic cones of G are M-trees). 

Indeed, if one of the asymptotic cones Con'^(G, (di)) is an M-tree, then the ball in the 
Cayley graph of G or radius di is o((ii)-hyperbolic for w-almost all i, and it remains to apply 
Theorem 13.261 since — )• oo. 

Theorem 13.291 is far from true for infinitely presented groups. The first example of a 
finitely generated infinitely presented (hence non-hyperbolic) group was constructed by 
Thomas and Velickovic in |228j . Their group is given by an infinite small cancelation 
presentation such that the lengths of relators form a very lacunary (sparse) sequence of 
numbers. As an almost direct application of Greendlinger's Lemma (see Theorem 13. 7p . one 
can prove that any cone Con'^(G, {di)) is an M-tree if di is far from the length of relators, 
and is not an M-tree if di is the length of the relator number i. 

Accordingly, in |184j . we introduced a class of groups some of whose asymptotic cones 
are M-trees, and called these lacunary hyperbolic groups. It is easy to deduce from Theo- 
rem 13.261 that every lacunary hyperbolic groups is an inductive limit of hyperbolic groups 
and surjective homomorphisms. A converse statement is also true if one imposes certain 
restrictions on the "injectivity radia" of these surjective homomorphisms. Tarski monsters, 
infinite finitely generated torsion groups, etc. can be constructed as lacunary hyperbolic 
groups. Surprisingly, some amenable (see I5.4.A[1 non- virtually cyclic groups turned out to 
be lacunary hyperbolic as well. For more details and results see |184j . 

3.2. Nilpotent groups. 

3. 2. A. The asymptotic cones and the c + 1-problem. Let G be a nilpotent group of class c 
(that is the cth member of its lower central series, Gc, is the identity; G„, is defined by 
Go = G, Gn+i = [Gn, G], n = 0,1, . . .). Every finitely generated nilpotent group is finitely 
presented. Algorithmic properties of nilpotent groups have been studied extensively, and 
this class can be considered relatively "tame". Note that the word, conjugacy and even 
the isomorphism problem are solvable for nilpotent groups (Grunewald, Segal [1061 1107] ). 
Still some natural algorithmic problems are known to be undecidable in some nilpotent 
groups. For example, there exists a finitely generated nilpotent group with undecidable 
endomorphic reducibility problem: given two elements of a group find out if one of them 
is an endomorphic image of another (Romankov |206| ). In particular, for some finitely 
generated nilpotent group, the problem of solvability of systems of equations in that group 
is undecidable. Indeed, G = {xi, . . . , Xm \ ri = 1, . . . , rn = 1) , is a finitely presented group, 
then the endomorphic reducibility of u to f is equivalent to the solvability of the following 
system of equations with respect to unknowns wi, ...,Wm- 
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{ri{wi, . . .,Wm) = 1 
rn{wi, . . .,Wm) = 1 
u{wi, . . .,Wm) = V 

Among nilpotent groups, particularly nice ones are graded nilpotent groups (sometimes 
also called homogeneous). Recall that a nilpotent group is called graded if for every x € 
Gk \ Gk+i,y € Gi\ Gi+i, the commutator [x,y] belongs to Gk+i \ Gk+i+i or is equal to 1 
(in a non-graded nilpotent group, the commutator {x,y] can belong to any Gn, n > k + I). 
With every nilpotent group G, one can associate a graded nilpotent group G^^ as follows 
G^^ is generated by the union of all sets Gk/Gk+i subject to all possible relations of the 
form [xGk,yGi] = [x,y]Gk+i- 

The growth function of a nilpotent group is always equivalent to a polynomial and its 
degree can be computed explicitly using the following formula: 

Theorem 3.30 (Guivarc'h |115| . Bass [13j). IfG is a finitely generated nilpotent group, and 
75(n) its growth function for some generating set S, then there are constants A,B > such 
that An'^ < 75(71.) < Bn"^ for all n > 1, where d = d{G) = X^/j>i hr^, r^ is the torsion-free 
rank of the Ahelian group Gh/Gh^i. 

In fact by the famous result of Gromov [96], groups having nilpotent subgroups of finite 
index are precisely the finitely generated groups whose growth functions are bounded by 
polynomial. The paper |96j was the first paper where asymptotic cones of nilpotent groups 
were studied. It was proved later by Pansu |194j that all asymptotic cones of any nilpo- 
tent group are isometric to a graded Lie group canonically associated to G. Here is the 
description. Let tor(G) be the finite normal subgroup of G generated by all elements of 
finite order (the fact that tor(G) is finite can be found, say, in |116j ). The nilpotent group 
G = G/tor(G) is without torsion, hence it can be embedded, according to Malcev |153| . as a 
uniform (co-compact) lattice in a nilpotent Lie group. The corresponding graded nilpotent 
group G is also a Lie group. That group equipped with the so-called Carnot-Caratheodory 
metricF1(see. for example, |100j ) is isometric to all asymptotic cones of G. In particular, all 
asymptotic cones of a nilpotent group are simply connected, and by Theorem I2.27( every 
nilpotent group has polynomial isoperimetric function. That result was strengthened by 
Gersten j84j : he proved that G admits a polynomial isoperimetric inequality of degree 2^ , 
where h is the Hirsch length of G i.e. the sum of torsion-free ranks of all Abelian groups 
Gj/Gj+i. The degree was improved to 2 • 3^^ by Conner [58], and then to 2c by Hidber |120j . 
One can deduce the bound c+l + e for every e > by the following argument. Pittet |199j . 
proved that a lattice in a simply connected graded nilpotent Lie group of class c admits 
a polynomial coarse area function A^ of degree c-\-l. Hence by the result of Pansu |194j . 
every asymptotic cone of any nilpotent group of class c has isoperimetric function rf~^^. 
It remains to apply Theorem 12.291 of Papasoglu. It is not clear how to remove e from the 
estimate. The methods of Dru^u [73] are based on the fact that the asymptotic cones in 
her situation were buildings. The first proof of the polynomial bound of degree c + 1 was 
outlined by Gromov in j99| 5.A5]. A complete combinatorial proof was found in [89]. 

^'''in order to define that metric, consider the tangent space at every point x of G. It is isomorphic to 
the Lie algebra g = G/Gi © GxjGi © . . . (with the natural commutator product) corresponding to G, and 
we can identify with g in a smooth way with respect to x. The elements of the first summand, G/Gi, 
generate g. The corresponding directions in G are called horizontal. A horizontal path on G is any path 
that has horizontal direction at every point. Every two points of G are connected by a horizontal path, and 
the length of a shortest such path connecting two points is the Carnot-Caratheodory distance between these 
two points. 
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Theorem 3.31 (Gersten, Holt, Riley [89]). Every finitely generated nilpotent group of class 
c has isoperimetric function that is polynomial of degree c+ 1. 

The isoperimetric inequality n'^'^^ is the best possible bound for some nilpotent groups 
of class c. For example if G is a free nilpotent group of class c with at least 2 generators, 
then its Dehn function is polynomial of degree c + 1 (see [15] or [87] for the lower bound 
and [199] for the upper bound, see also [38] for other examples of nilpotent groups with 
maximal possible Dehn functions). Moreover, the following general theorem is proved in 

[ia[99]. 

Theorem 3.32 (Gromov [99], Baumslag, Miller, Short [E]). Let G be a nilpotent group of 
class c and let Gc-i be the (c — l)st term of its lower central series. If Gc-i contains an 
infinite cyclic subgroup; then the quotient group G/Gc-i is a nilpotent group of class c — 1 
and has the Dehn function n^. 

In particular, is the factor-group of the Heisenberg group by its derived subgroup 
which contains an infinite cyclic group. Hence we get another proof that the Dehn function 
of I? is n?. 

3.2.B. The Heisenberg groups. Nevertheless rf^^ is not the smallest isoperimetric function 
for many nilpotent groups: 

Theorem 3.33 (Allcock [3], Olshanskii-Sapir |185j ). The 2n+l- dimensional integral Heisen- 
berg groups admit quadratic isoperimetric functions for n > 1; these groups are all nilpotent 
of class 2. 

Definition 3.34. Recall that the 3-dimensional Heisenberg group has presentation Hj, = 
{x,y,c I [x,y] = c, [c, x] = [c, y] = 1). Its center is generated by c. The 2n + 1-dimensional 
Heisenberg group -ff2n+i is the nth central power of i?3. In general, the central power of a 
group G is the factor-group of the nth direct power G x . . . x G by the normal (moreover, 
central) subgroup consisting of all vectors (ci,...,c„) with ciC2 . . . c„ = 1, where all q 
belong to the center of G. 

The proof of Theorem 13.331 from [185] is relatively simple but it has some features in 
common with very complicated computations of Dehn functions (say, |11H 12381 1239] ). so 
we provide some details here. 

In order to describe the proof of Theorem 13.331 let n = 2, that is let us consider the 
group 

= {xi,x2,yi,y2,c \ [xi,x2] = [yi,y2] = c, [xi,yj] = 1, 
[c,Xi] = [c,yi] = l,i,j E {1,2}). 

Consider a word w in the generators xi, X2,yi,y2- Suppose that u; = 1 in H^. We need 
to show that applying the relations of we can reduce the word to 1 so that the number 
of steps is 0(|tt;p). First notice that we can use the commutativity relations [xj, yj] = 1 and 
rewrite w as WxWy where Wx is a word in xi,X2, Wy is a word in yi,y2- We need 0(|tt;p) 
steps to do that. Now the proof consists of several reductions. 

Reduction 1. Note that if u{xi,X2) is a product of commutators, then w{xi,X2) = 
w{yi^y2) in H^. We prove that using at most 0(|tip) steps we can transform u{xi,X2) to 
u{yi,y2)- Therefore we can restrict ourselves to words u{xi,X2) in two variables only (but 
we use the other two generators for auxiliary transformations). 

Reduction 2. Every word u{xi,X2) can be drawn on the (xi, 2;2)-planar square grid: xi 
labels horizontal edges, X2 labels vertical edges (similar to l2.1.C|) . A word n is a product of 
commutators if and only if it labels a loop. Then it is easy to see that in H^, u is equal to 

for some a called the symplectic area of the region spanned by the curve. In particular u 
equals 1 in if and only if its symplectic area is 0. 
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Reduction 3. Now let us define a symplectic area of any word u(xi, X2), not necessarily 
a product of commutators. Let ki, i = 1,2 be the sum of exponents of Xi in u. Then the 
word C{u) product of commutators. Then the symplectic area of u is, by 

definition, the symplectic area of C{u). 

Reduction 4. A rectangular word R{p, q) where p,q > are integers is, by definition 
[X]',j;2]- Its symplectic area is pq. Our goal is for every word u{xi,X2), reduce the word 
C{u) to a rectangular word of the same area in number of steps quadratic in \u\. If n = 1 
in H^, then the rectangular word is empty, and we are done. 







U2/ 


IMl^ — 





Figure 11. The division in halves trick 



Reduction 5. We are using the "division in halves" trick shown on Figure [TTJ The 
word u is represented as a product of two subwords ui,U2 with \ui\ ~ 1^^1/2. Note that 
the loop corresponding to the word C{u) on Figure [TT] is decomposed into three loops: the 
loops corresponding to C{ui),C{u2) and a rectangular word R with sides < \u\. Then 
in order to reduce C{u) to a rectangular word, we need to be able to reduce C{ui) to a 
rectangular word Ri, reduce C{u2) to a rectangular word R2, and then reduce the product 
of three rectangular words R1RR2 to a rectangular words. The first two tasks can be done 
arguing by induction on the length of the words. Hence it remains to consider products of 
rectangular words. That is done by cutting and pasting (using Reduction 1). 

Note that Reduction 1 is crucial in this proof. Here we use the fact that has two 
extra generators 2/1,2/2 which we use as a temporary storage. Say, in order to trans- 
form u{xi,X2)v{xi^X2) to f (xi, X2)ti(xi, X2) where u^v are products of commutators, we 
first transform u[xi,X2) into ^(2/1,2/2) then use commutativity of generators and transform 
u(xi, X2)'y(2/i, 2/2) to v{yi,y2)u{xi,X2) and then transform ^(2/1, 2/2) back into v{xi^X2). This 
transformation is used in many places in the proof. 

Note that H-^ has cubic Dehn function (proved in |78j). 

3.2.C. Other central products. The central powers of nilpotent groups of class 2 have very 
small Dehn functions. The following result is mentioned in our paper with A. Olshanskii 
|185j . Its proof is essentially the same as the proof for the Heisenberg group i^2n+i- Another 
proof is given in R. Young |237j . Young's approach is in a sense a combination of approaches 
from [3] and [185] . 

Theorem 3.35. ( [1851 Remark on page 927]). Let G be any finitely generated nilpotent 
group of class 2. Then 

(a) The central square of G admits isoperimetric function v? log n. 

(b) Some central power of G has quadratic isoperimetric function. 

We were unable to show that the central square of every nilpotent group of class 2 
has quadratic isoperimetric function. More concretely, let FN(2, 10) be the free nilpotent 
group of class 2 generated by ai , 61 , . . . , 05 , 65 (it is given by all the defining relations of the 
form [[x, y],z] = 1, where x,y, z G {ai, . . . , 05, 61, . . . , ^5}). Let Gio be the factor-group of 
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FN {2, 10) by the central cyclic subgroup generated by [ai, bi] . . . [05, 65]. Then we were not 
able to prove that the central square of do has quadratic Dehn function (this example and 
the question can be found in |237t Section 5]). 

3.2.D. Wenger's result. Recently S. Wenger |235] showed that in fact central squares of 
nilpotent groups of class 2 often do not admit quadratic isoperimetric inequality. In partic- 
ular, he proved 

Theorem 3.36 (Wenger |235j ). The Dehn function of the central square of Giq is strictly 
greater than quadratic. 

The idea of the proof is the following. Let H be the central square of Gio- Suppose that 
H has quadratic Dehn function. Then by Theorem l2.30l in the asymptotic cone of H, every 
Lipschitz loop 7 must bound an integral 2-current of area c|7p for some constant c. Wenger 
gets a contradiction by finding a loop that does not bound an integral 2-current at all. In 
the proof, he essentially uses Pansu's description of asymptotic cones of nilpotent groups 
as nilpotent Lie groups with a Carnot-Carateodory metric (see |194j and 13. 2. A|) . Assuming 
that his loop bounds an integral 2-current he approximates C by singular Lipschitz chains. 
He then uses the fact that every Lipschitz map from a disc to a Carnot-Caratheodory 
space is differentiable almost everywhere (Pansu |194] ). 

Therefore Wenger obtained the first example of a finitely generated nilpotent group whose 
Dehn function is not equivalent to a polynomial (compare with Guivarc'h-Bass' Theorem 
13.301 about the growth functions). 

Problem 3.37. Find a combinatorial proof of Theorem 13.361 Is the Dehn function of the 
central square of Gio equivalent to log n? 

3.2. E. Other nilpotent groups. Note also that R. Young |238j constructed nilpotent groups 
of arbitrary class with quadratic Dehn functions. 

3.3. Semihyperbolic groups. Semihyperbolic groups were defined by Gromov |98j and 
with more precision by Alonso and Bridson [1]. Let X be a metric space. Let ^ > 1, 5 > 
be two numbers. A discrete {A, B)-path in X is any [A, i?)-quasi-isometry / from an interval 
of natural numbers [0,Tj] into X. We shall extend / to the whole N by f{n) = f{Tf) for 
n> Tf. Suppose that for some A, B we can choose a discrete (^, i?)-path pa^h between any 
pair of points a, 6 G X in such a way that for every four points a, h, a' , h' and every f G N 
we have the combing condition: 

(6) dist(pa,fe(i)iPa',6'(0) ^ Ci max(dist(a, a'), dist(6, 6')) + C2 

for some constants Ci,C2. Then we say that X is semihyperbolic. It is easy to prove that 
every hyperbolic metric space is semihyperbolic (exercise: prove that using asymptotic 
cones) and that a metric space that is quasi-isometric to a semihyperbolic space is semi- 
hyperbolic space itself. A finitely generated group G is called semihyperbolic if its Cayley 
graph with respect to some (equivalently, any) finite generating set is semihyperbolic and 
Pga,gb = gPa,b for every g,a,b & G. In that case we can consider the language of words label- 
ing the paths p\^h for all b € G. If that language is regular, then the semihyperbolic group 
is called bi-automatic. If the language is regular, but we only assume that the combing 
property Q holds provided a = a', then the group is called automatic [78j . 

3. 3. A. CAT(0)-spaces. Let X be a geodesic metric space, ABC be a geodesic triangle in X 
with side lengths p, q^ r. We can construct the triangle A'B'C' in the Euclidean space M? 
with the same side lengths. If a,b are points on the sides AB, BC of the triangle ABC, 
let a', b' be the corresponding points on the sides A'B' and B'C of A'B'C. If for every 
choice of the triangle ABC and points o, b we have dist(a, b) < dist(a', b'), then X is called 
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a CAT(O) space [991 [37]. In every CAT(O) space there exists unique geodesic p^^.yconnecting 
any two points x, y. These geodesies satisfy the combing property ^ (see [37j). Hence every 
CAT(O) metric space is semihyperbohc. The class of CAT(O) spaces includes all hyperbolic 
spaces, complete simply connected Riemannian manifolds all of whose sectional curvatures 
are non-positive (that is, Hadamard manifolds), including symmetric spaces of non-compact 
type, Euclidean buildings (see [42]). 

Every group acting properly discretely and co-compactly on a CAT(0)-space is semihy- 
perbohc. This implies that many non-hyperbolic groups are semihyperbohc. For example: 

(1) All Coxeter groups [4] (the fact that these groups are automatic was proved in [40]). 

(2) Right angled Artin group^ and many other Artin groups (see [l9]) including the 
direct product of two free groups (see 12. 3. C|) . 

(3) The mapping class groupLJ of a surface with genus g and n punctures provided 
3(7 — 3 + n > 2 (the fact that these groups are automatic was proved by Mosher 
[165] , the bi-automaticity was proved by Hamenstadt [117j ). 

(4) All co-compact lattices in SL„(M) (and many other Lie groups) because the corre- 
sponding symmetric space SL„(]R)/SO„(M) is CAT(O). 

(5) Ah lattices in SO„,i(M) |37]. 

3.3.B. Semihyperbolic metric spaces have quadratic isoperimetric inequality. Indeed, con- 
sider every loop 7 of length / and point a on that loop. Subdivide the loop into a sequence 
of subarcs of length 6 (for some constant 6) by points 61, . . . , 6„,, and connect a with bi, . . . ,bn 
by the (A, i?)-quasi-geodesics Pa,bi- The length of each of these quasi-geodesics is at most 
0{l). Then by the combing property ([6]) for every t € N the distance between Pa,bii't) and 
Pa,bi+i{'t) is bounded by some constant C = C{6). Thus we obtain a collection of points 
Pa,bi{t), t G N, containing the points 61, . . . ,bn, having mesh 0(1). The number of points 
in that set is at most 0{P). Hence the Gromov coarse area function As{l) is bounded by 
0{P) for a sufficiently large 6. 

A very similar proof shows that automatic groups have quadratic isoperimetric functions 
too. 

3.3.C. The conjugacy problem in semihyperbolic groups. Another key property of semihy- 
perbolic group is the following 

Theorem 3.38 (Alonso-Bridson [3]). Every semihyperbolic group has solvable conjugacy 
problem. 

Proof. Let G be a semihyperbolic group, u,v G G. Suppose that x~^ux = v for some 
x G G, and the length \x\ of a; is the smallest possible. Clearly it is enough to bound |x| 
as some recursive function in \u\ + \v\. We show that one can take an exponential function 
exp(c(|ti| + \v\) for some constant c. Indeed, let x = aia2 ■ ■ ■ am be a product of generators 
and their inverses. Note that in the Cayley graph of G we have two paths pi,P2 labeled by 
X, pi connecting 1 and x, and p2 connecting u and ux. Since xv = ux, the initial points 
of pi and p2 are at distance \u\ from each other and the terminal points are at distance 
\v\ from each other. By the combing property ([6]), then there exist paths Qi, i = 1, . . . ,m 
connecting ai . . .ai with uai . . . Oj of length < C{\u\ + |f |) for some constant G. If m is 
larger than the number of different words in the generators of G of length G{\u\ + |f|) 
(which is exp(c(|u| + \v\) for some constant c), then two of these paths, say qi,qi^j have 

^^A right angled Artin group is a group given by a presentation of the form {ai,...,a„ \ aiaj = 
ajUi for some i, j £ {1, . . . , n}) 

^^The mapping class group of a surface S is the group of all isotopy classes of orientation preserving 
self-homeomorphisms of 5*. This group is finitely presented for every S. 
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the same labels. Xhen let y — oi . . . OjOj-i-j-i-i . . . flm. It is easy to see that y uy — v which 
contradicts the minimality of x. □ 

Geometrically this proof can be described as follows. Cut a mimimal annular diagram 
A by a simple path p connecting the boundary components. Using the combing property 
([6|) , for every vertex of p we can find a (non-null-homotopic) loop in A based at that point 
and having length at most C times the sum of lengths of the boundary components. We 
can assume that these loops do not cross each other. If p is too long, two of these loops 
will have the same label. Cutting off the annular subdiagram bounded by these two loops 
from A, we obtain another, smaller, annular van Kampen diagram with the same boundary 
labels (see also the proof of Lemma 14.81 below) . 



4. Dehn functions and algorithmic properties of groups 

4.1. The zoo of groups with quadratic Dehn functions. We have already mentioned 
that automatic and semihyperbolic groups have quadratic isoperimetric functions. In fact 
the set of finitely presented groups with quadratic isoperimetric functions is quite diverse. 

Theorem 4.1. The following groups have quadratic Dehn functions: 

(1) Drut-u's examples of ^o\ycyc\\^^non-nilpotent groups from [73] (see also [99l Section 
5.A.9];, 

(2) Cornulier-Tessera examples of non-polyciclic metabelian groups with quadratic Dehn 
functions (some of these groups contain the Baumslag-Solitar group BS(1,2)J [59]. 

(3) The Heisenherg groups of rank > 5, sufficiently large central power of any (infinite) 
finitely generated nilpotent group of class 2, some finitely generated nilpotent groups 
of arbitrary nilpotency class (Alcock [3], Olshanskii-Sapir jl85j . Young \237\ 1238] ). 

(4) All extensions of finitely generated free groups by cyclic groups (Bridson, Groves 

m)- 

(5) The Stallings group |224| (the kernel of the homomorphism from the direct product 
of three free groups of rank 2 to Wj that sends all generators to the generator of Z) 
(Dison, Elder, Riley, Young [71] 

(6) The groups SL{n,Z), n > 5 (Young [239] j. 

(7) The R. Thompson group F (Cuba [TTT] )R. 

The proofs of the quadratic isoperimetric inequality for each of these classes of groups is 
quite complicated and quite different. 

4.1. A. . Drutu's groups from Part (1) of the theorem are uniform lattices in solvable Lie 
groups of class 2. She proves quadratic isoperimetric inequality for these Lie groups us- 
ing asymptotic cones (these are horoballs in direct products of Euclidean buildings) and 
Theorem [2291 

4.I.B. . Nilpotent groups from part (2) have been discussed already in Section [321 



20a group G is called polycychc if it has a series of subnormal subgroups G = Go ^ Gi > . . . > Gn ~ {1} 
with cyclic factors Gi/Gi+i. 

91 

Note that Stallings' group was the first example of a finitely presented group which is not of type FP3. 
Thus by [71] and a previous results of Riley [203j and Papasoglu [196j this group is the first example of a 
finitely presented group with simply connected but not 2-connected asymptotic cones. 

^^The R.Thompson group F consists of all piecewise-linear increasing bijective functions [0, 1] — [0, 1] 
whose derivatives have finitely many dyadic break points and the slopes of linear pieces are powers of 2. The 
operation: composition of functions. That remarkable group has a presentation with two generators and 
two defining relations [46j . 
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4. I.e. . The proof for free- by-cyclic groups is very long and compicated. Here is a short 
description of that proof sent to us by Daniel Groves. 

Let G = x</, Z be a free-by-cyclic group, with natural HNN-presentation 

V = {Xi, . . . ,Xn,t \ tXit~^ = (j){xi)). 

Let A be a minimal diagram over this presentation. Consider the maximal t-bands in A. By 
Lemma [2.23l every maximal t-band starts and ends on the boundary of A. Every HNN-cell 
has a top side (corresponding to a letter Xi) and a bottom side (corresponding to (j){xi)). 
Passing from the bottom to the top corresponds to applying the automorphism (p. It is 
easy to see that a pair of HNN-cells cannot be glued along their bottoms in a minimal 
A, so each 2-cell can be traced backwards through A via the edge on the bottom sides of 
HNN-cells until it hits an edge on the boundary of A. If such an edge e G (9A is fixed, then 
all the 2-cells corresponding to e form a single color (and there are fewer than |9A| colors). 
The proof involves a detailed investigation of the two transverse structures coming from 
bands and colors. The evolution of a color and of different colors is closely related to the 
dynamics of words in Fn under iterates of 4>, although the interaction of bands complicate 
this substantially. 

4.I.D. . The proof of quadratic isoperimetric inequality of the Stallings group is purely 
combinatorial. The authors in [7T] present an algorithm of creating a van Kampen diagram 
with quadratic area and a given boundary label. 

4. I.E. . The problem of whether the Dehn function of S'Ln(Z) is quadratic for large enough 
n was open for quite some time. The group SL2(Z) is hyperbolic (in fact virtually free), 
so its Dehn function is linear. It was proved in [78] that SL^iZ) has exponential Dehn 
function. The fact that the Dehn function of SLn(Z) is at most exponential for every n was 
stated in Gromov [991 5.A7] and fully proved by Leuzinger |142j . The book [78] contains a 
statement attributed to Thurston that for n > 4 the Dehn function is quadratic. Recently 
Robert Young proved in |239j that the Dehn function is quadratic for n > 5. The case 
n = 4 is still open. 

To explain Young's proof, we need some preliminaries. Note that S-L„(M) is quasi- 
isometric to the symmetric space SL„(M)/SOn(-R), and the word metric in SL„(Z) is quasi- 
isometric to the restriction of the Riemannian metric on SL„(M) to SL„(Z) by Lubotzky- 
Mozes-Raghunathan |145j . The action of SL„(Z) on the symmetric space SL„(M)/SO„(i?) 
is not co-compact. The typical case is when n = 2. In that case the symmetric space 
SL2(M)/S02(M) is just the Hyperbolic plane H^, say the upper half of the complex plane 
with the hyperbolic metric. The group SL2(K) acts on by the Mobius transformations: 



Then the action SL2(Z) on has a fundamental domain which is an ideal triangle (with 
one vertex at infinity) . Consider the set Hoo of all z E with imaginary part at least 1 (the 
number 1 can be replaced by any real number > 0). It is a horoball in H^. Note that H^o is 
stable under the action of the subgroup of SL2(M) consisting of upper uni-uppertriangular 



group SL2(Z) consists of horoballs too. The union of these horoballs is the thin part of the 
symmetric space H^. The complement of the thin part is the thick part. The thick part is 
clearly invariant under the action of SL2(Z) and the action is co-compact. A similar picture 
exists if n > 3. The symmetric space SL„(M)/SOn(]R) has a horoball Hqq. The union of 
the horoballs SL„(Z) • i^oo is called the thin part of the symmetric space, its complement is 





The orbit of H^o under the action of the 
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the thick part, and the action of SL„(Z) on the thick part is co-compact. Thus it is natural 
to consider "equivalent objects": the group SL„(Z) and the thick part of the corresponding 
symmetric space. 

These preliminaries should be enough to understand a short description of the proof from 
[239] sent to us by Robert Young. Every word w that is equal to 1 in SL„(Z) corresponds 
to a loop in the symmetric space SL„(M)/SOn(M). The basic idea of the proof is to use 
fillings of loops in the symmetric space SL(p; R)/SO(p) (which has quadratic area function 
being semihyperbolic) as templates for fillings loops in the Cayley graph of SL„(Z). Fillings 
which lie in the thick part of SL„(M)/SOn(ffi) correspond directly to fillings in SL„(Z), but 
in general, an optimal filling of a curve which is contained in the thick part may have to 
go deep into the thin part. The proof uses both geometric and algebraic points of view. 
Different horoballs in the thin part correspond to different parabolic subgroups of SL„(Z) 
(each of these subgroups is conjugate to some group of block upper triangular matrices), 
so one key step was to develop geometric techniques to cut the quadratic filling of a loop 
in the symmetric space into pieces (subdiscs) with each piece lying in one horoball. The 
word corresponding to the boundary of each piece is a product of three words, each of 
which represents an element of the parabolic subgroup (these elements are determined by 
the geometry of the filling). Each of these three words can be written as a product of 
a block diagonal matrix and a unitriangular matrix. The uni-triangular matrix can be 
written as a product of boundedly many elementary matrices (one non-zero off-diagonal 
entry) where the off-diagonal entry may be of exponential size. The elementary matrices 
can be replaced by relatively short (linear length) words in SL„(Z) (shortcuts) using the 
technique from Lubotzky-Mozes-Raghunathan |145j . For this purpose Young includes the 
elementary matrices in some solvable subgroups with quadratic Dehn functions where the 
cyclic subgroups generated by these elements are exponentially distorted. This is not very 
mysterious. We have already seen (see I2.6.C|) that the cyclic group generated by a is 
exponentially distorted in BS(1, 2). The group BS(1, 2) is metabelian and has the following 



in this representation. Of course as we have seen (Example I2.22p . the Baumslag-Solitar 
group has exponential Dehn function. But Dru^u [73], and later Cornulier and Tessera |59j 
have shown that metabelian groups of higher ranks can have quadratic Dehn functions. 
The key property of Young's solvable subgroups is the freedom of choosing them. This is 
similar to having two commuting copies of the Heisenberg group in H^, in the proof from 
I3.2.BI and this is why Young's proof does not work in SL4 (Z) : there is simply not enough 
room to place several commuting solvable subgroups with quadratic Dehn functions. The 
shortcuts allow Young to reduce the problem of filling the original loop to the filling problem 
for loops in some subgroups of block-diagonal matrices (which are just direct products of 
matrix groups of smaller sizes). Repeatedly applying these geometric and combinatorial 
techniques breaks the original word into pieces which lie in smaller and smaller subgroups 
of SL„(Z) and ultimately to a quadratic filling. 

4. I.E. . The result that the R.Thompson group F has quadratic Dehn function jlllj was 
quite unexpected. It was conjectured by Gersten [88] that the Dehn function of F is 
exponential. It was proved to be at most exp(log^ n) in our paper with Guba |112j . For 
some time we thought that exp(log^ n) is equivalent to the Dehn function of F because we 
had a concrete sequence of loops in the Cayley graph of F for which we could not find fillings 
with smaller areas. A big breakthrough was achieved by Guba in [109j where a polynomial 
isoperimetric function was found. Still it was hard to believe that the Dehn function is 
quadratic because the polynomial fillings used in [109] seemed optimal. 
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Recall that the R. Thompson group F has infinite presentation 
(7) {xQ,xi, ... I x^^XjXi = Xj+iMi < j). 

It is also generated by xo,xi subject to 2 defining relations. It turned out that instead of 
dealing with a finite presentation of F one is better off working with the infinite presentation 
obtained from ([7|) by considering only those relations where j — i < 5. It is not difficult to 
prove that this (infinite) presentation defines F and the quadratic isoperimetric inequality 
for this presentation is equivalent to a quadratic inequality for a finite presentation of F. 
(In defining an isoperimetric inequality for the infinite presentation, one needs to modify 
the notion of length of a word; Guba uses a notion of complexity of a word which is the 
length plus the difference between the highest and the lowest indices of letters occurring in 
the word.) Cuba's proof is in a sense similar to the proof from |185j outlined in l3.2.Bl He 
also introduces normal forms of elements in F (in the case of these were the rectangular 
words R{p, q)), and considers the process of reducing a word to its normal form rather than 
reducing a word that is equal to 1 in to the empty word. Using a trick similar to the 
"division in halves trick" from l3.2TBt Guba reduces the problem to finding the filling area 
of words of the form uv where u, v are normal forms. These are called the triangular words 
in [lllj . He then transforms these words to words of different shapes (rectangular, vertical, 
horizontal). In order to do that Guba uses various elaborate cutting and pasting techniques. 
Some of these amount to dividing a word into three parts which are of approximately (but 
not quite) equal lengths. Instead of three pieces as on Figure [TTl Guba gets 8 pieces, and 
the following recurrent formula for the isoperimetric function /(n): 

/(n) <8/(n/3)+0(n2). 

The fact that the coefficient of /(n/3) in that formula is smaller than 9 = 3^ plays crucial 
role. It turns out that the reduction procedure described in [lllj not always decreases the 
complexity of a word. In order to make increases rare enough, one needs to cut the words 
into three parts of not quite equal lengths and the required freedom is allowed by the fact 
that 8 < 9. 

4.1. G. Rips' question. Note that the following problem formulated by Rips in 1994 is still 
open. 

Problem 4.2 (Rips). Is F automatic? 

It is known |112] that F admits a regular language of (unique) normal forms. But that 
set of normal forms does not satisfy the combing property ([6]). 

Also note that in |110j . Guba proved that the simple R.Thompson groups T and V 
[46) have polynomial isoperimetric functions (these groups are finitely presented by [121| ). 
Exact Dehn functions of these groups are unknown. Simple finitely presented groups with 
quadratic Dehn functions (and even bi-automatic) are known (see Burger and Mozes [44^ 

m)- 

Problem 4.3. Is F semihyperbolic? 

4.2. The conjugacy problem for groups v^ith quadratic Dehn functions. Semihy- 
perbolic groups and all the groups mentioned in Theorem 14.11 have one thing in common: 
the conjugacy problem in any of these groups is solvable. For semihyperbolic groups the 
proof is easy (see Theorem I3.38P . The proofs for the different classes of groups mentioned 
in Theorem 14. II are quite different, and sometimes very complicated. 

4. 2. A. . For polycyclic (in particular, nilpotent) groups, solvability of the conjugacy prob- 
lem was proved by Formanek [82] using representation of these groups by matrices. 
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4.2.B. . For finitely presented metabelian groups, the solvability of conjugacy problem was 
proved by Noskov |170j using the Magnus representation of metabelian groups by matrices 
over commutative rings, and some very non-trivial commutative algebra (finding groups of 
units of commutative rings, in particular). 

4.2.C. . For free-by-cyclic groups it was proved by Bogopolski, Martino, Maslakova and 
Ventura [23] using some deep properties of the dynamics of free group automorphisms, in 
particular, Bestvina-Feighn-Handel train tracks [19j . 

4.2.D. . For the Stallings group, the solvability of conjugacy problem is proved by Bridson 
in |34] . He applies the following nice result (we weaken the formulation a little bit in order 
not to introduce new terminology). 

Theorem 4.4 (Bridson [31]). Let G be a bi-automatic group and let N <\ G he a normal 
subgroup. If the membership problem for finitely generated subgroups of G/N is solvable, 
then N has a solvable conjugacy problem. 

Recall that the input of the membership problem for finitely generated subgroups of 
a group G is a tuple of elements of G (given as words in generators) ui, . . . ,Un,v. The 
output is "yes" if v belongs to the subgroup generated by ui, . . . ,Un. 

4.2.E. . For the group SL(n,Z), the solvability of conjugacy problem was proved indepen- 
dently by Grunewald |105j and Sarkisyan [212j . The diversity of methods used in these 
papers is reflected by the following paragraph taken from Roger Lyndon's review of [105| 
in Math Reviews: "The arguments, given explicitly, are long and intricate but appear for 
the most part to use hardly more than standard ideas from the theory of linear groups and 
their modules, from number theory, and, from the theory of groups, coset representations 
and the Reidemeister-Schreier theorem." 

4.2.F. . The conjugacy problem in the R.Thompson group F was first solved by Guba and 
myself in [113j using the diagram group representation of F. More recently a different 
solution was found by Belk and Matucci [16| . 

4.2.G. Rips ' problem. These results justify the following general problem which was formu- 
lated in 1994 as a conjecture (before most of the above results confirming this conjecture 
appeared): 

Problem 4.5 (E. Rips). Is the conjugacy problem solvable in every finitely presented group 
with quadratic Dehn function? 

The fact that this problem is difficult is shown by the diversity of methods used in proving 
solvability of the conjugacy problem mentioned above, and the following "quasi-proof from 
our paper with A. Olshanskii |191j . 

In order to formulate the statement, we shall need the well known constructive version 
of a limit of a sequence of numbers. In fact we are going to use that definition only in the 
case when the limit is 0. 

Definition 4.6. Let g: N — )■ M be a function. We say that the constructive limit of g{n) 
as 77, —7- oo is if for every integer A > there exists = N{A) such that for every 
n > N, 1/(77)1 < 1/A, and the function N(A) is recursive. In that case we shall write 
lim^_j.o^ g{n) = 0. It is easy to see that lim^_j.;^ g{n) = if and only if there exists an 
increasing recursive function f{n) such that g{k) < for every k > f{n). 

Quasi-Theorem 4.7. Let d{n) be the Dehn function of a finite group presentation P. 
Suppose that lim'f^^^ ^^"^ = 0. Then P has decidable conjugacy problem. 
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Proof. We shall need the following lemma. 

Lemma 4.8. Let A be a minimal annular diagram with boundary components p,p' over a 
finite group presentation P. Let x be a shortest path connecting p and p' . Then the area of 
A is at least C\x\ log \x\ for some constant C depending on P. 

Proof. Consider the following construction. Let po = P (considered as a cyclic path) be 
the inner contour of the diagram A. Suppose that we have constructed a cyclic path pi 
surrounding the hole of the diagram in A such that pi does not have common vertices with 
p'. Let Ki be the annulus bounded by po and pi. Let Mj+i be the set of cells of A outside 
Ki that have common vertices with pi. Then let -fCj+i be the minimal annular subdiagram 
of A with simple contours that contains Ki and all cells from Mj+i. Let Pi+i be the outer 
contour of Ki (the inner contour oi Ki is p = po). 

It follows that every edge of the path pi-\-i belongs to the contour of one of the cells of 
Mj_|_i. Hence every vertex of pj+i can be connected with a vertex of pi by a path such that 

(0) the length of the path is bounded by a constant, 

(1) it can be connected with pQ by a path of length at most 0{i) and 

(2) the number of cells in Mj+i is at least 0(|pj+i|). 

From (1), it follows that the number of subdiagrams Ki is 0(|a;|). Furthermore, more than 
a half of the paths pi have length at most log^ |x| where c is, say, four times the number of 
letters in the alphabet of the presentation P. Indeed, otherwise we would have two paths pi 
and Pj, i ^ j with the same labels, and we could remove the annular subdiagram between pi 
and pj reducing the area of A (that would contradict the minimality of A), see an analogous 
trick in the proof of Theorem 13.381 above. 

From (2), it follows that at least half of the subsets Mi contain at least 0(log|a;|) cells 
each. Since these sets do not intersect, the number of cells in A is at least 0{\x\ log \x\). □ 

Now the "quasi-proof of the Quasi-Theorem [4.71 proceeds as follows. Suppose that P is 
a finite presentation with undecidable conjugacy problem. Suppose that the constructive 
limit of „^[og „ is 0. Then, in particular, d{n) is bounded from above by a recursive function, 
and P has solvable word problem. 

Note that if an annular diagram A with boundary labels u and v has a simple path x 
with label t connecting the boundary components, then we can cut A along x and obtain a 
disc van Kampen diagram with boundary label t^^ut^^v~^ . So if \t\ is recursively bounded 
in terms of and \v\ (for every u and v that are conjugate modulo P) then the conjugacy 
of u and v can be algorithmically verified. 

Pick an increasing recursive function f{n) with ^^a'^^g ^ < ^ for every k > f{n) where C 
is the constant from Lemma 14.81 (as in Definition 14. 6p . Since the conjugacy problem for P 
is undecidable, there exists a minimal annular diagram A with contours p, p' such that any 
path in A connecting p and p' has length at least f{\p\ + \p'\)- Let n = |p| + \p'\. Let x be 
a shortest path connecting p and p'. Thus 



Since x is a shortest path connecting p and p' , x is simple. Let us cut A along x and 
obtain a disc diagram P with boundary label zuz~^v~^ where z is the label of x^^. By 
Lemma 14.81 the area of P is at least C|rE| log \ x\. 



(8) 



x\ > f{n) 



and so 



(9) 
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Now we can take an integer m between M — 1 and — . We attach m copies of T con- 
secutively to each other along the sides labeled by z to get a van Kampen diagram H with 
boundary label zu^z~^v~^'' . The perimeter r of 11 is between 2\x\ and 3|2;|, and the area is 
m times the area of A. So, by Lemma |4.8| the area of 11 is at least ^il£Li2iJ£i _ gy -^^g 
can deduce that the area of IT is bigger than d{r). This contradicts the definition of Dehn 
function of a group presentation. □ 

Remark 4.9. The only gap in the preceding argument is contained in the last phrase. We 
cannot guarantee that 11 has minimal area among all diagrams with the same boundary 
label, and, in principal, the area of a minimal diagram with this boundary label may be 
even quadratic in terms of the perimeter r. Still we do not know any groups for which this 
proof does not work. Note that we do not need 11 to be minimal: only that the minimal 
diagram with the same boundary label does not have too few cells compared to 11. Also 
note that 11 cannot be "obviously" non-minimal. For example, it is easy to see that 11 
cannot contain two cells that share an edge and are mirror images of each other (if such a 
pair of cells existed, we would replace their union by a diagram with no cells, reducing the 
number of cells). Also we have freedom of choosing n, u, A and x. We do not need x to be 
a minimal length path connecting the boundary components of A. We only need that the 
area of A exceeds 0{\x\ log |x|) divided by a recursive function in n (depending only on the 
presentation). In addition, the number m should only be 0{ |^|^|^,| )■ Thus Quasi-theorem 
14.71 seems true for a very large class of groups and possibly for all finitely presented groups. 

4.2. H. The sharp hound of n^logn. It is shown in |191j that Quasi-theorem \A7i\ is true for 
groups that are multiple HNN-extensions of free groups (for the definition see 12. 6. Bp . 

Theorem 4.10 (See |191j ). Let d{n) he the Dehn function of a multiple HNN-extension of 
a free group. Suppose that Ivca!^^^ n^iog n ~ ^^^'^ ^ decidahle conjugacy prohlem. 

The bound logn in Theorem 14.101 turned out to be sharp. 

Theorem 4.11 (See jl91j ). There exists a finitely presented group that 

• is a multiple HNN-extension of a free group, 

• has undecidahle conjugacy prohlem, 

• has Dehn function n^logn. 

It is pointed out in |191j and [36] that Theorem 14.101 and the results of [36] give an 
alternative prove of solvability of the conjugacy problem in free-by-cyclic groups (originally 
proved in [23]). 

4.2.1. The conjugacy prohlem in automatic groups. It is still unknown whether the con- 
jugacy problem is decidable in every automatic group although it is decidable in every 
bi-automatic group (by Theorem 13.381 since these groups are semihyperbolic) . 

Problem 4.12. Can the quasi-proof above and, specifically. Remark 14.91 be applied to 
prove that automatic groups have decidable conjugacy problem? Is it possible to apply the 
quasi-proof to groups from Theorem 14.11 in order to obtain a unified proof of the solvability 
of conjugacy problem for these groups? 

4.3. A description of isoperimetric functions > n^. It is most probable that the class 
of Dehn functions > n'^ log n is as wide as the class of time functions of Turing machines 
with the same restriction. The next theorem confirms that conjecture in the case of Dehn 
functions > n^. 
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4. 3. A. Superadditivity. We call a function / superadditive if for all natural numbers m, n 
we have /(m + n) > f{m) + f{n). The problem of whether all Dehn functions of finitely 
presented groups are superadditive (up to equivalence) is still open. We conjectured in |210j 
that the answer is "yes" . In |114j Guba and myself proved that every free product G * H 
where G and H are non-trivial finitely presented groups, has a superadditive Dehn function. 
For example, for every group G the free product G * Z has a superadditive Dehn function 
(here Z is the infinite cyclic group). In fact the Dehn function of G* Z is the superadditive 
closure of the Dehn function of G, i.e. the smallest superadditive function exceeding the 
Dehn function of G. Thus if there was an example of a finitely presented group G with 
non-superadditive Dehn function, it would be also an example of a group G whose Dehn 
function is not equivalent to the Dehn function of G * Z. Note that a presentation of G * Z 
is obtained from the presentation of G by adding one new generator and no extra relations. 
The problem of superadditivity of all Dehn functions is similar in nature to the problem 
of whether the quasi-proof from 14.2] works for all groups with quadratic Dehn functions. A 
counterexample is hard to imagine, let alone construct. 

4.3. B. The theorem. 

Theorem 4.13. (See Sapir-Birget-Rips [2101 Theorem 1.2]). 1. Let P4 be the set of 
all Dehn functions d{n) > of finitely presented groups. Let 71 be the set of time 
functions t(n) > of arbitrary (non-deterministic) Turing machines. Let 7"^ be the set of 
superadditive functions which are fourth powers of time functions. Then 

2. For every non-deterministic Turing machine T with time function T{n) such that T'^{n) 
is superadditive there exists a finitely presented group G with Dehn function T^{n) and the 
isodiametric function T^{n) such that the problem recognized by T reduces polynomially 
to the word problem in G (see 12. 2. G]) . 

Note that we do not know if 

• Every time function of a non-deterministic Turing machine is equivalent to a super- 
additive function. 

• The square root of a time function of a non-deterministic Turing machine is equiv- 
alent to the time function of a non-deterministic Turing machine. 

If one or both of these statements were true, it would be (obviously) possible to simplify 
the formulation of Theorem 14. 131 

4.4. NP-complete and coNP-complete groups. 

4. 4. A. NP-complete groups. Now let us take a non-deterministic Turing machine M recog- 
nizing some NP-complete problem, say, the exact bin packing Problem 12. 8i Let G be the 
group from Theorem 14.131 that corresponds to M. Then G has polynomial Dehn function, 
hence its word problem is in NP (by I2.2.jp . On the other hand since the problem recog- 
nized by M polynomially reduces to the word problem in G, the word problem in G is 
NP-complete. Hence we get 

Corollary 4.14. (See [2101 Corollary 1.1]). There exists a finitely presented group with 
NP-complete word problem. 
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4.4. B. CoNP- complete groups. Theorem 14.131 only treats the "yes" part of the word prob- 
lem. Van Kampen diagrams are not witnesses of the "no" part (i.e. inequalities w ^ 1). 
Nevertheless we have the following 

Theorem 4.15 (Birget |21j). There exists a finitely presented group G whose word problem 
is coNP- complete. 

Birget reduces a known coNP-complete problem, namely the circuit equivalence prohlerr^ 
to the word problem in a concrete finitely presented subgroup of the Thompson-Higman 
group V [46] (denoted by G2,i in Higman |121j ). 

That group, as all Thompson groups, consists of (partial) transformations of the set of 
infinite binary words. One of the simple key ideas is that if G is a group of transformations 
of a set X, then a witness to the inequality f ^ g for f,g £ G is just an element x € X 
such that f{x) / g{x). We have mentioned already in I2.2.GI that the group V has been 
used before in constructing finitely presented groups with undecidable word problem by 
McKenzie-Thompson |155| . They used V to simulate a construction of arbitrary recursive 
functions from simple building blocks (somewhat similar to boolean circuits). 

4.5. The isoperimetric spectrum. The isoperimetric spectrum consists of all numbers 
a > 1 such that is equivalent to the Dehn function of a finitely presented group. By 
Theorem 13.251 the intersection of the isoperimetric spectrum with the open interval (1,2) 
is empty. The next theorem gives an almost complete description of all numbers in the 
isoperimetric spectrum that are > 4. 

Theorem 4.16 (Sapir |210j ). The isoperimetric spectrum contains all numbers a > 4 
whose n-th binary approximatioi^^ can be computed by a deterministic Turing machine in 
time less than 2^ . On the other hand, Theorem \2. 1 ?| implies that if a is in the isoperimetric 
spectrum then the n-th binary approximation of a can be computed in time < 2^^ . 

The difference in the number of 2's in these expressions, is the difference between P and 
NP in Computer Science (if P = NP then there should be two 2's in both expressions). 

All "constructible" numbers (rational numbers, algebraic numbers, values of relatively 
easy computable analytic functions at rational points such as e + 2, vr + 1, 10 sin |, etc.) 
which are > 4 satisfy the condition of Theorem 14.161 In fact, these and most other familiar 
numbers can be computed by very fast quasilinear algorithms (see J. Borwein and P. Borwein 

EZl). 

The intersection of the isoperimetric spectrum with the interval [2, 4] is less known, 
although it is known that it is dense, contains all rational and many transcendental num- 
bers from that interval [331 [30l [3T] . The advantage of the snowflake construction and its 
modifications employed in these papers is that one obtains explicit and often quite short 
presentations of groups with prescribed Dehn functions of the form n° while the groups 
from |210j are obtained by simulating Turing machines computing binary approximations 
of a (and so the number of generators and relators are usually quite large and writing 
explicit presentations is not a feasible task). 



23a boolean circuit [232) is a "device" built of elementary circuits connected by wires; each elementary 
circuit represents a basic operation from logic: AND, OR, NOT and FORK. Every circuit has input wires and 
output wires. Every time we send signals or 1 through input wires, we get a combination of signals or 1 
coming through the output wires. Hence every circuit represents a boolean function. The circuit equivalence 
problem asks, given two (acychc) boolean circuits, do they have the same input-output function? 

2^That is a rational diadic number /3 — a/2" which is within 0(2~") from a 
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4.6. Groups with polynomial-non-recursive and quadratic-non-recursive Dehn 
functions. Let F^^^^ and -^i;^ig}^ be two classes of increasing functions N — )■ N. We say that 
a function h: N — t- N is an Fj^^— non— Fijjgj^-function, if for some / G ^Iqw ^(^) — /(^) 
infinitely many n, and for every g E -^high' ^("') > ^l'^) ^oi infinitely many n. For example, 
a function h is polynomial-non-recursive if h is not bounded above by any recursive function, 
but is smaller than some fixed polynomial f{n) for infinitely many values of n. Similarly 
we can define the class of functions which are quadratic-non-recursive, etc. 

4. 6. A. Polynomial-non-recursive. 

Theorem 4.17 (Olshanskii, Sapir [182j ). There exists a finitely presented group with poly- 
nomial-non-recursive Dehn function. 

Proof. We shall sketch a proof showing how to deduce Theorem 14 . 1 71 from the results of |210j . 
It shows a general strategy of estimating Dehn functions of groups from |210j corresponding 
to the so called S-machines (see Section [6]). Let M be the Turing machine from Theorem 
12.61 tbat is M has undecidable halting problem and infinitely many /i-good numbers where 
h{n) = expexpn. In |210] . it is shown how for every Turing machine M, one can construct a 
group G = G{M). The fact that M has undecidable halting problem, and |210t Propositions 
4.1, 12.1] imply that G has undecidable word problem. The presentation of G from |210j 
shows that G is obtained from some special multiple HNN-extension of a free group (called 
an S-machine) with free letters 6i, . . . ,6^ by imposing one relation which is called the hub 
(see ESI). 

To estimate the Dehn function of G, we need to estimate the areas of van Kampen 
diagrams over the presentation of G from |210j . Let A be a van Kampen diagram over that 
presentation with boundary label w of length I. We assume that A is minimal. 

By |210l Proposition 11.1] we can do the so called snowman decomposition of A, to obtain 
three sequences of diagrams Ei,. . . , Eg, Li,..., Ts~i, Hi,..., n<i_i, with the following 
properties: 

(51) Es = A. 

(52) El contains no hubs, i.e. Ei is a diagram over G'. 

(53) For every i = l,...,s — 1, we have that Ei^i is a union of subdiagrams Fj, E'j, Ilj 
having no common cells, and Ilj is either empty or contains exactly one hub cells 
and no 6'j-edges on its boundary. 

This implies that we can glue together the diagrams Ei,Ti, . . . , Ts-i, Hi, ... , II^-i so as 
to obtain a diagram with boundary label w. Therefore in order to find the upper bound for 
the area of A we need to estimate the sum of the areas of Fi, . . . , Fs_i, Hi, . . . , II^-i. 

By [2101 Lemma 11.21] for every i = 1, . . . , s — 1, 

\diE,)\ < \d{E,+i)\. 

Therefore 

\d{Ei)\ < \w\, i = l,...,s, 

and s < 

The diagram Ei and for every i the diagram Fj contain no hubs. So by [2101 Lemma 
8.1], 

Area(F,) <ci|9(Fi)|^ 

and 

Area(Fi) < ci\d{Ei)\^ 
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for some constant ci. We also know that < \w\ and by [210^ Lemma 11.21] |5(rj)| < 

C2|9(£'i+i)| < C2|w|, for z = 1, . . . , s — 1 and some constant C2. Therefore 

Area(£;i) + Area(ri) + . . . + Area(rs_i) < 

ci|9(i?i)|3 + ci|a(ri)|3 + ci|a(r2)|2 + ... 
+ci|a(r,_i)|3 < 

CiC2s|w|^ < C3|lf;|^. 

for some constant C3. 

It remains to estimate the area of Ilj . Let 11 be one of the Ilj, i = l,...,s — 1. Let 
W be the word written on the boundary of 11. Note that \W\ < C5I for some constant C5 
because the union of Ei,Ti,Ili is -Ej+i by property (53) and the estimates of the length 
\diEi)\,\d{Ti)\ above. 

We can assume that 11 is a minimal diagram. Then by |210|, Propositions 4.1] some 
subword w of the boundary label of H is an accepted configuration of M and the area 
of n is bounded from above by CQtime{w)'^ for some constant cq. Moreover time{w) is 
either bounded by cjIwI or does not exceed C8time(w;') for some input configuration w' with 
\w'\ < \w\ and some constants cyjCg. Now suppose t^- < I < — for some /i-good number 
6^1. Then \w'\ < \w\ < b. Hence time{w') < log 6 < /. Thus the area of H is bounded 
by cgP for some constant cg. Thus the area of A is bounded by some polynomial in I. 
Since the set of /i-good numbers of M is infinite by the choice of M, the group G has 
polynomial-non-recursive Dehn function. □ 

4.6.B. Quadratic-non-recursive. The following theorem which is much stronger than The- 
orem [57T7] was proved in |182j . 

Theorem 4.18 (Olshanskii |182j ). There exists a finitely presented group with quadratic- 
non-recursive Dehn function. 

Comparing with I4.6.AI the proof of Theorem 14.181 requires significantly new ideas. It is 
easy to see that the proof from I4.6.AI does not give quadratic upper bound for infinitely 
many values of n (for instance, the S'-machines usually have cubic Dehn functions |21Uj . 
so one cannot hope to bound the area of even Ei by a quadratic function). The proof of 
Theorem 14.181 involves 

• A new construction of an S'-machine simulating the Turing machine from Theorem 

[2:61 

• A new way of decomposing a diagram over the presentation of the S-machine plus 
the hub. Several parameters of diagrams and several cutting and pasting operations 
on diagrams are introduced. One of the parameters is the area, another one is the 
perimeter, and there are several others. Every cutting operation reduces certain 
expression involving the parameters which makes the induction possible. 

5. HiGMAN EMBEDDING 

5.1. A characterization of groups with w^ord problem in NP and other time 
complexity classes. The fundamental result of Higman |122j gave an algebraic charac- 
terization of groups with recursively enumerable word problem (i.e. recursively enumerable 
set of words that are equal to 1 in the group). 

Theorem 5.1 (Higman jl22| ). The word problem in a finitely generated group G is recur- 
sively enumerable if and only if G is a subgroup of a finitely presented group. 

There are several different proofs of Theorem 15. 1[ See, for example, Rotman's book 
|207j and Manin's book |157| (where you can also find an interesting discussion of the 
philosophical significance of that theorem). See also 16.41 below. 
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After |122j . there were several results showing that embedding into a finitely presented 
group can preserve or even improve the algorithmic properties of the group. First results 
have been obtained by Clapham [H] and Valiev [231j (see |189j for the history of these 
results): they proved that the solvability (even r.e. degree) of the word problem and the 
level in the polynomial hierarchy of the word problem is preserved under some versions of 
Higman embedding. 

By Theorem 12.181 the Dehn function of a finitely presented group is recursive if and 
only if the group has decidable word problem. Moreover by Proposition 12.101 for every 
finitely presented group G with Dehn function T{n) there exists a nondeterministic Turing 
machine M{G) which solves the word problem in G and has time function equivalent to 
T{n). Roughly speaking, this Turing machine takes a word w over the generators of G and 
just inserts relators of G. It stops and accepts w when it gets 1. Clearly this machine solves 
the word problem in every finitely generated subgroup of G as well. Therefore if a finitely 
generated group G is a subgroup of a finitely presented group with polynomial isoperimetric 
function then the word problem in G is in NP (i.e. it can be solved by a non-deterministic 
Turing machine with polynomial time function). 

The drawback is that the word problem in a group G can be easy to solve but the Dehn 
function of G can be huge. A typical example is the Baumslag-Solitar group BS(1,2) = 
{a,h I hab-^ = a^). This group has exponential Dehn function (see Example 12. 22p . But we 
have mentioned in 14.1. El that this group is a subgroup of GL(2, Q) so the word problem in 
BS(1, 2) can be solved in at most quadratic time: it is easy to see that the word problem of 
every finitely generated group of matrices over a field of rational numbers can be solved in 
at most quadratic time by a deterministic Turing machine. In fact it is possible to solve the 
word problem there in time n(log n)^(log log n) (this follows from the fact that the product 
of two n-digit numbers can be computed in time n log n log log n |141j ). 

Nevertheless, the following theorem shows that every finitely generated group G with 
word problem in NP can be embedded into a finitely presented group H with polynomial 
isoperimetric function. Thus if we can solve the word problem in G using a very clever and 
fast Turing machine, then we can use the simple-minded but almost as fast Turing machine 
M{H) to solve the word problem in G. 

Theorem 5.2 (Birget, Olshanskii, Rips, Sapir [22]). Let G he a finitely generated group with 
word problem solvable by a non- deterministic Turing machine with time function < T(n) 
such thatT{n)'^ is superadditive (that is T{m + n)'^ > r(m)^ + T(n)^ for every m,n). Then 
G can be embedded into a finitely presented group H with isoperimetric function equivalent 
to n'^T[n'^)^ . In particular, the word problem of a finitely generated group is in NP if and 
only if this group is a subgroup of a finitely presented group with polynomial isoperimetric 
function. 

For matrix groups over rationals. Theorem 15.21 implies that every such group is embedded 
into a finitely presented group with Dehn function at most n^^^'^ for every e > 0. Young's 
result from |239j (see I4.1.EP implies that for matrix groups over Z one can reduce the 
exponent 10 + e to 2. 

Problem 5.3. Is it true that every finitely generated matrix group over a field (in particular, 
the field of rational numbers) embeds into a finitely presented group with quadratic Dehn 
function? 

Theorem 15.21 immediately implies 

Corollary 5.4. The word problem of a finitely generated group G is in NP if and only if 
G embeds into a finitely presented group with polynomial isoperimetric function. 
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By Fagin's Theorem 12.91 this corollary gives an algebraic characterization of groups where 
the word problem admits an algebraic description. 

Using Corollary 15. 41 (and the proof of Theorem 15 . 2p . in order to embed a finitely generated 
group with word problem in NP into a finitely presented group with polynomial isoperimetric 
function, one needs first construct a Turing machine which solves the word problem, then 
convert it into an 5-machine, then convert the S'-machine into a group. As a result the group 
we construct will have a relatively complicated set of relations. In some particular cases like 
the free Burnside groups Bm,n, the free solvable groups, etc., it is possible to modify this 
construction and get simple presentations of groups with polynomial isoperimetric functions 
where these groups embed [186] . For the Baumslag-Solitar group BS(1,2) it was done in 
|186] too, and an even much simpler presentation was found by Cornulier and Tessera [59] 
(see also Theorem 14. H part (2) ). 

Theorem 15.21 is also interesting from the logic point of view. One can consider a group 
as a logical system where the defining relations are axioms, and the inference rules are 
constructing step by step van Kampen diagrams. The van Kampen diagrams are then 
proofs of their boundary labels. Then the Dehn function becomes the syntactic complexity 
of proofs. The computational complexity of the word problem is the complexity in the 
metaworld of the group theory. The embedding of groups becomes a conservative extention 
of theories. 

With this vocabulary, the result means that the complexity of proofs in the outer world of 
groups (that is, in its metamathematical semantics) after appropriate conservative extension 
to a finitely axiomatized theory becomes the complexity of proofs in the syntactical sense 
(up to a polynomial correction). 

In this formulation, one can ask a similar question for any logical system. Several results 
of this type for general logical systems can be found in [139] . 

By Theorem 12 . 271 the class of groups with polynomial isoperimetric functions is a subclass 
of the class of groups with simply connected asymptotic cones. Hence we formulate 

Problem 5.5. Does every finitely generated group with word problem in NP embed into 
a group all of whose asymptotic cones are simply connected? Equivalently (by Theorem 
15. 2p : does every finitely presented group with polynomial isoperimetric function embed into 
a group all of whose asymptotic cones are simply connected? 

Note that by Theorem 12. 27| groups with simply connected asymptotic cones have linear 
isodiametric functions. On the other hand, for every k > A there are groups with Dehn 
function and isodiametric function by [210] . Thus if the embedding as in Problem 
15.51 exists, it must distort lengths at least polynomially. The embeddings in [22] distort 
lengths linearly (see [22]), so a quite different construction should be used. 

5.2. The space functions of Turing machines and the filling functions of groups. 

The results about space functions of Turing machines and the FFFL functions of groups 
[183] are similar to the results about time functions and isoperimetric functions obtained 
in [2101 122] . On the one hand the situation is simpler because it is easier to control the 
space function than the time function of a machine. Also the issues with superadditivity 
does not appear in this situation. Indeed, if we want to check if two words are equal to 1 
using a Turing machine, then the time spent will be the sum of times needed to checking 
each word (hence we need superadditivity of the Dehn function) but the storage space is 
approximately the maximal of the spaces needed for each word because after we proved 
that one of these words is 1, we can clean the tape and start proving that the other word is 
equal to 1 (so superadditivity is not needed). On the other hand, the descriptions are more 
finalized which requires more detailed consideration of geometry of van Kampen diagrams. 
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If in [2101 122j . for example, we just cut the diagram into pieces and estimate the number 
and the areas of the pieces, here one needs to carefuhy examine the lengths of the cuts. Also 
working with time and Dehn functions, we could modify Turing and 5-machines by adding 
artificial tapes which can potentially by very long, but serve as a "temporary storage space" . 
Dealing with space and FFFL functions, we cannot do that which adds to the complexity 
of the situation. 

Nevertheless the following complete description of FFFL functions of groups was obtained 
by Olshanskii. 

Theorem 5.6 (Olshanskii |183j ). Every space function f{n) of a Turing machine is equiv- 
alent (as in \2.2.D]) to the FFFL function of some finitely presented group. 

Since space functions of Turing machines can be "arbitrarily complicated" (similar to 
time functions), we obtain finitely presented groups with "arbitrary complicated" FFFL 
functions. 

Also similar to the isoperimetric spectrum one can define the space spectrum of finitely 
presented groups as the set of all a such that n" is equivalent to the FFFL function of a 
finitely presented group. Since the problem with non-determinism vs determinism is not an 
issue, we have a complete description of the space spectrum. 

Theorem 5.7 (Olshanskii jl83j . compare with Theorem I4.16p . For a real number a > 1, 
the function n" is equivalent to the FFFL function of a finitely presented group if and only 
if the nth binary approximation of a is computable by a deterministic Turing machine using 
space < 2 . 

An analog of Theorem 15.21 also was obtained in [183| . Note that by Proposition 12.20] the 
space complexity of the word problem in a finitely presented group G does not exceed the 
FFFL function of G. It follows from jl50j . |53j that the converse statement fails. An easier 
example is given by the Baumslag 1-relator group from jl4j G = {a,b \ {aba~^)b{aba~^)~^ = 
6^). Its FFFL function is not bounded from above by any multi-exponential function (see 
Gersten [86] and Platonov |200| ) while the space complexity of the word problem for G is 
polynomial (see A. G. Miasnikov, A. Ushakov, and Dong Wook Won jl67j ). 

Theorem 5.8 (Olshanskii |183j ). Let G be a finitely generated group such that the word 
problem for G is decidable by a deterministic Turing machine with space function f{n). 
Then G embeds into a finitely presented group with FFFL function equivalent to fin). 

Similarly to Corollarv 15.41 one immediately deduces 

Corollary 5.9 (Olshanskii |183j ). The word problem of a finitely generated group G belongs 
to PSPAGE if and only if G embeds into a finitely presented group with polynomial FFFL 
function. 

5.3. Collins' problem. The conjugacy problem turned out to be much harder to preserve 
under embeddings. We have already mentioned the results of Collins and Miller j55j and 
Gorjaga and Kirkinskii[93]: even subgroups of index 2 of finitely presented groups do not 
inherit solvability or unsolvability of the conjugacy problem (see 12. 4. B]) . 

In 1976 D. Collins |138j posed the following question (problem 5.22): Does there exist a 
version of the Higman embedding theorem in which solvability of the conjugacy problem is 
preserved? The solution is given by the following two theorems. 

Theorem 5.10 (Olshanskii, Sapir |189j ). A finitely generated group H has solvable con- 
jugacy problem if and only if it is Frattini embedded into a finitely presented group G with 
solvable conjugacy problem. 
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Theorem 5.11 (Olshanskii, Sapir |190j ). Every countable recursively presented group with 
solvable word and power problems is embeddable into a finitely presented group with solvable 
conjugacy and power problem. 

Recall that a subgroup H of a group G is Frattini embedded in G if every two elements 
of H that are conjugate in G are also conjugate inside H. It is clear that if H is Frattini 
embedded into G and G has solvable conjugacy problem, then H also has solvable conjugacy 
problem. We say that G has solvable power problem if there exists an algorithm which, given 
u,v in G says if f = for some n 7^ 0. 

Theorem 15.111 is a relatively easy application of Theorem 15.101 

In the same Problem 5.22 of |138j Collins asked whether a version of Higman embedding 
can preserve the recursively enumerable hierarchy (the recursive sets are at the bottom of 
that hierarchy). The affirmative answer is also given in |189j . 

The construction in [189j is much more complicated than in [22] or |183j . First we embed 
H into a finitely presented group Hi preserving the solvability of the word problem (say, 
one can use [22]). Then we use the Miller 5-machine M{Hi) (see I6.1.A1) to solve the word 
problem in H. In order to overcome technical difficulties, we needed certain parts of words 
appearing the computation to be always positivj^. The standard positivity checkers do 
not work because they are (S-machines as well, and can insert negative letters. So we 
used some ideas from the original Boone-Novikov proofs. That required introducing new 
generators, x-letters (in addition to the a-, q-, and ^-letters in 5'-machines) and Baumslag- 
Solitar relations (as in Example I2.22p . In addition, to analyze the conjugacy problem in 
G, we had to consider annular diagrams which are more complicated than van Kampen 
disc diagrams. Different types of annular diagrams (spirals, roles, etc.) required different 
treatment. 

We do not have any reduction of the complexity of the conjugacy problem in H to the 
complexity of the conjugacy problem in G. In particular, solving the conjugacy problem in 
G, in some cases required solving systems of equations in free groups (i.e. the Makanin- 
Razborov algorithm, see I2.3.E|) . 

5.4. A finitely presented non-amenable group without free subgroups. One of 

the most important applications of our versions of Higman embeddings so far was the 
construction of a finitely presented counterexample to the von Neumann problem, i.e. a 
finitely presented non-amenable group without non-Abelian free subgroups [188] . 

5. 4. A. Short history of the problem. Suppose that a group G acts on a metric space X by 
isometrics. We say that the action is paradoxical if X can be decomposed as a disjoint 
union Ai U . . . U U i?i U . . . U i?„ (m, n > 1) such that for some gi, . . . , gm, /ii , . . . , /i„ of 
G we have X = giAi L) . . . L) gm^m = /ii-Bi U . . . U h^Bn. Hausdorff |118j proved that the 
action of (5*0(3) on the unit sphere minus a countable set of points is paradoxical: one can 
subdivide the 2-sphere minus a countable set of points into 3 parts A, B, C, such that the 
union of any two of these parts can be obtained by rotating the third part (hence we can 
cut a sphere into a finite number of pieces, rotate these pieces and obtain two spheres of 
the same radius). Banach and Tarski [12] generalized Hausdorff 's result and proved what 
is now known as Banach- Tarski paradox. Von Neumann [233] noticed that the cause of the 
paradoxes is the structure of the group G. In particular the cause of the Hausdorff paradox 
is that S03(M) contains a non-Abelian free subgroup. He called the groups that cannot act 
paradoxically amenable. Von Neumann showed that the class of amenable groups contains 
Abelian groups, finite groups and is closed under taking subgroups, extensions, and infinite 
unions of increasing sequences of groups. Day [6^ and Specht |223j showed that this class is 



'A word is called positive if it does not contain inverses of the alphabet letters. 
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closed under homomorphic images. The class of groups without non-Abelian free subgroups 
is also closed under these operations and contains Abelian and finite groups. 

The problem of existence of non-amenable groups without non-Abelian free subgroups 
probably goes back to von Neumann and became known as the "von Neumann problem" 
in the fifties. 

First counterexamples to the von Neumann problem were constructed by Olshanskii 
[173j . He proved that the Tarski monsters, both torsion- free and torsion (see |175j ). are not 
amenable. Later Adian [2] showed that the non-cyclic free Burnside group of odd exponent 
n > 665 with at least two generators, that is the group given by the presentation 

(fli, . . . , I li" = 1, for all group words u in the alphabet {ai, . . . , am}), 

is not amenable. Recently Liick and Osin |146j constructed the first example of residually 
finite non- amenable torsion group. 

All these examples are not finitely presented. For the Tarski monsters and Burnside 
groups this is because these groups are not hyperbolic but are inductive limits of hyperbolic 
groups (see l3.1.Jl) . a similar argument applies to the groups of Liick and Osin. The question 
about existence of finitely presented counterexample to von Neumann's problem was ex- 
plicitly formulated by Grigorchuk in |138| and by J. Cohen in [5l]. There exists one finitely 
presented group without non-Abelian free subgroups for which the problem of amenabil- 
ity is non-trivial: it is the R.Thompson's group F (for the definition of F look in Section 
I2.1.AP . The question of whether F is not amenable was formulated by R. Thompson in the 
beginning of the 70s (unpublished), and, in print, by R. Geoghegan in 1979. A considerable 
amount of work has been done to answer this question but it is still open. 

5.4.B. The result. Together with A. Olshanskii, we proved the following theorem. 

Theorem 5.12 (Olshanskii, Sapir |188j ). For every sufficiently large odd n, there exists a 
finitely presented group Q which satisfies the following conditions. 

(1) Q is an HNN-extension of a finitely generated infinite group of exponent n. 

(2) Q is an extension of a group of exponent n by an infinite cyclic group. 

(3) G contains a subgroup isomorphic to the free Burnside group of exponent n with 2 
generators. 

(4) G is a non-amenable finitely presented group without free non-cyclic subgroups. 

By a theorem of Adian f2], part Q implies that Q is not amenable. Thus parts ([TJ and 
Q imply part (g]). 

5.4.C. The proof. Let us present the main ideas of our construction. We first embed the free 
Burnside group B{m, n) = {B) of odd exponent n » 1 with m > 1 generators . . . , bm} = 
B into a finitely presented group Q' = {C \ TZ) where B d C. This is done as in [22l I187j 
using an explicitly constructed S'-machine. Then we take a copy A = {oi, . . . , a^} of the 
set B, and a new generator t, and consider the group given by generators C U ^ U {t} and 
the following three sets of relations. 

(1) the set TZ of the relations of the finitely presented group Q'; 

(2) (n-relations) y = Uy, where Uy,y £ C, is a certain word in A satisfying the small 
cancelation condition C"(A) for a very small A; 

(3) (HNN-relations) t~^ait = bi,i = 1, . . . ,m; these relations make {A) a conjugate of 
its subgroup of exponent n (of course, the group {A) gets factorized). 

The resulting group Q is obviously generated by the set v4U {t} and is an HNN-extension of 
its subgroup {A) with the stable letter t. Every element in {A) is a conjugate of an element 
of {B), so {A) is an m-generated group of exponent n. This immediately implies that Q 
is an extension of a group of exponent n (the union of increasing sequence of subgroups 
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t^{A)t~^, s = 1, 2, . . .) by a cyclic group. It turns down that (A) contains a copy of the free 
Burnside group B{2,n). Thus the group is non-amenable by Adian [2] (see |188j and |208j 
for details). 



6. Methods: S'-machines and related constructions 

Several theorems discussed in this survey have been proved using S-machines, a natural 
blend of Turing machines and groups (see Theorems 14. IH I4.13| 14.161 14.171 14.181 15.21 15.61 
15.81 15.1(11 15.12|) . Here we shall present a short introductions to S'-machines. 



6.1. The definition. There are several ways to look at S'-machines introduced in |210j . 
One can view them as a version of Turing machines (see I2.2.C1) . as rewriting systems (as 
in j210] ). as groups that are certain multiple HNN-extensions of free groups (see |191j ) 
or as semigroups of partial l:l-transformations of the set of admissible words as in |188j . 
For different applications, one needs different points of view. Probably the easiest way to 
introduce S'-machines is by defining them as multiple HNN-extension of a free group. 



6.1. A. The Miller machine. Let us start with an example that we call the Miller machine. 
This important example is due to C. Miller [161] , Let G = {X | i?) be a finitely presented 
group. The Miller machine is the group M{G) generated by X U {q} U {Ox \ x G X} U {9^ \ 
r G R} subject to the following relations 

6x = x6, OxXq = qxOx, O^q = qrOr 

where 9 is any letter in = {Ox \ x G X} U {9^ \ r G R}. Clearly, this is an HNN-extension 
of the free group {X,q) with free letters 6 £ Q. The main feature of M{G) discovered by 
Miller is that M{G) has undecidable conjugacy problem provided G has undecidable word 
problem. In fact it is easy to see that qw is conjugated to q in M{G) if and only \i w = 1 
in G. 

To see that M{G) can be viewed as a version of a Turing machine, consider any word 
uqv where u.,v are words in X U . If we conjugate uqv by Or, we get the word uqrv 
because Orq = qrOr and Or commutes with u and v (here and below we do not distinguish 
words that are freely equal). Hence conjugation by Or amounts to executing a command 
[q —7- qr]. Similarly, conjugation by Ox amounts to executing a command [q — t- x^^qx]. If 
u ends with x, then executing this command means moving q one letter to the left. Thus 
conjugating words of the form uqv by 0's and their inverses, we can move the "head" q to 
the left and to the right, and insert relations from R. 

The work of the Miller machine M{G) can be drawn in the form of a van Kampen diagram 
(see Figure \T2\i that we call a trapezium. It consists of horizontal 0-bands. The bottom side 
of the boundary of the trapezium is labeled by the first word in the computation (uqv), the 
top side is labeled by the last word in the computation (q), the left and the right sides are 
labeled by the history of computation, the sequence of 0's and their inverses corresponding 
to the commands used in the computation uqv — )■ ... — )• q. The words written on the top 
and bottom sizes of the 0-bands are the intermediate words in the computation. We shall 
always assume that they are freely reduced. 
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Figure 12. Trapezium of the Miller machine for a deduction uqv —)•...—)• 5. 
Here u = y . . .x^ v = . . . z. 



Viewed as a Turing machine, the Miller machine has one tape and one state letter. 

6.I.B. The formal definition. Like Turing machine (see 12.2. C]) general S -machines can have 
many tapes and many state letters. Here is a formal definition. 

Let F{Q,Y) be the free group generated by two sets of letters Q = u'^LiQi and Y = 
U^^S^^Yi where Qi are disjoint and non-empty (below we always assume that Qn+i = Qi, 
and Yn = Yo = 0). 

The set Q is called the set of g-letters, the set Y is called the set of a-letters. 

In order to define an HNN-extension, we consider also a collection Q of A^-tuples of 9- 
letters. Elements of G are called rules. The components of 9 are called brothers 9i, . . . , 9js[. 
We always assume that all brothers are different. We set ^at+i = 9i. 

With every 9 € Q, we associate two sequences of elements in F{Q U Y): B{9) = 
[Ui,...,Un], T{9) = [Vi,...,Vn], and a subset Y{9) = yjYi{9) of Y, where Yi{9) C Yi. 

The words f/j,!^ satisfy the following restriction: 
(*) For every i = 1, . . . , A^, the words Ui and Vi have the form 

Ui = Vi-ikiUi, Vi = v[_ik[u'i 

where fcj, k[ G Qi, ui and u[ are words in the alphabet Y^^ , Vi-i and v^_-^ are words 
in the alphabet Y^^\. 

Now we are ready to define an 5-machine S by generators and relations. The generating 
set X of the 5-machine S consists of all q-, a- and ^-letters. The relations are: 

Ui9i+i = 9iVi, i = l,... ,s, Oja = a9j 

for all a £ Yj{9). 

Every S-rule 9 = [Ui Vi, . . . ,Us ^ Vs] has an inverse 9~^ = [Vi Ui, . . . ,Vs — )• Us]; 
we set Yi{9-^) = Yi{9). 

Remark 6.1. Every S'-machine is indeed an HNN-extension of the free group F{Y, Q) with 
finitely generated associated subgroups. The free letters are 9i for every 9 £ Q. We leave 
it as an exercise to find the associated subgroups. 

6. I.e. Turing machines as S -machines. Every Turing machine T (see the definition in 
I2.2.CP can be considered as an S'-machine S'{T) in the natural way: the generators of 
the free group are all tape letters and all state letters. The commands of the Turing 
machine are interpreted as rules of the S'-machine. The main problem in that conversion 
is the following: there is a much bigger freedom in applying S-rules than in executing the 
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corresponding commands of the Turing machine. Indeed, a Turing machine is in general 
not symmetric (i.e. if [U V] is a, command of the Turing machine then [V — )■ U] is 
usually not) while every S'-machine is symmetric. Another - more important - difference is 
that Turing machines work only with positive words, and 5-machines work with arbitrary 
group words. This, for example, means that the rules [aqb — )• cqd] and [q — )■ a~^cqdb~^] are 
equivalent (the corresponding group presentations are equivalent). Hence an S'-machine is 
blind: it decides which rule to apply based solely on the state letters of the configuration, it 
does not "see" the content of the tape (unlike the Turing machine which takes into account 
the tape letters observed by the head). Hence the language accepted by S'{T) is usually 
much bigger than the language accepted by T. 

Nevertheless, it can be proved that if T is symmetric, and a computation wi ^ W2 ^ ■ ■ ■ 
of the (S-machine S'{T) involves only positive words, then that is a computation of T. 

This leads to the following idea of converting any Turing machine T to an S'-machine 
S{T). First we construct a symmetric 'Turing machine T' that is equivalent to T (recognizes 
the same language). That is a fairly standard Computer Science trick (see |210j ). 

The second step is to compose the S-machine S'(T') with a machine that checks positivity 
of a word. That machine starts working after every step of S'{T'). That is if an application 
of a rule of S'(T') gives a non-positive (reduced) word, then the checking machine does not 
allow the machine S'{T') to proceed to the next step. 

There are several checking machines. One of them - the adding machine - is very simple 
but its time function is exponential (see |191| ). Another one is very complicated but it has 
a quadratic time function (it was first constructed in j210j and used in [2101 [22] ) . 

6.2. The conjugacy problem. If a Turing machine M has undecidable halting problem, 
then the S-machine S{M') (where M' is a symmetric Turing machine that is equivalent 
to M) has undecidable conjugacy problem |210] . It is easy to see that the Dehn function 
of any S-machine is at most cubic. The adding machine used for checking positivity of 
admissible words exponentially slows down the machine. A nice consequence is that the 
Dehn function of the resulting 5-machine is log n [191j which is the minimal possible for 
multiple HNN-extensions of free groups having undecidable conjugacy problem by 14. 2. Hi 
Note that although the upper bound of n? log n is easy to anticipate, a proof of it involves 
very non-trivial combinatorics of van Kampen diagrams (some Vassiliev-type invariants of 
chord diagrams are used, see |191j ). 

6.3. The word problem. To obtain a group with undecidable word problem and, more 
generally, with a prescribed Dehn function, we do the following. Take an S'-machine 5, with 
many tapes. In order to make the number of tapes large enough, just glue the copies of 
the same S-machine side-by-side so that the trapezia of the new S-machine are glued from 
copies of trapezia of the old S-machine as in I4.2.G[ Let h be the accept word of the S- 
machine (all tapes are empty, all state letters are in the accept state). Let the presentation 
of the group G{S) be obtained from the presentation of S by adding one huh relation h = 1. 
Now if W is an admissible word accepted by 5, we can take the corresponding trapezium 
A, identify the left and right sides of it (labeled by the history of computation) to obtain 
an annulus with W on the outside boundary component and h on the inside boundary 
component, then glue the cell corresponding to the hub relation to the inside boundary 
component to obtain a disc A'. This van Kampen diagram shows that if W is accepted by 
S, it is equal to 1 in G{S). The fact that if W = 1 in G{S) then W is accepted by S is 
proved by some small cancelation argument using the fact that the number of tapes is large. 
Thus, for example, if S has undecidable halting problem, then G{S) has undecidable word 
problem. Proving that the Dehn function of G polynomial in terms of the time function of 
S is much harder |210j and requires the snowman decomposition discussed in I4.6.AI 
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6.4. The embedding. Let now P = {X \ R) he a finitely generated group with recursively 
enumerable set of relators R. Take an S'-machine S that recognizes the language R. We 
can assume that S has sufficiently many tapes, and in tape 1 (between the state letters 
^1) ^2) we have the input word in the input configuration. Consider now another S'-machine 
S' which is a copy of S (with copies of the state letters) but it does not do anything in 
the input sector, so during the work of S' the input sector is always empty. We can also 
assume that the state letters of the input configurations in S,S' do not appear in the 
middle of (reduced) computations, the input state letters of S and S' coincide, all other 
state letters are different. Now let H be the amalgamated product of G{S) and G{S') with 
the amalgamated subgroup generated by the input state letters and all tape letters. Let 
W{u) be the input configuration of S corresponding to the input word u. It has subword 
u between state letters ki and k2. Let W{u)' be the word W{u) with this occurrence of u 
deleted. Then W{u) = 1 in G(5)(and thus u R) and W'{u) = 1 in G{S'). Hence u = 1 
in H. Therefore every u S i? is equal to 1 in and there exists a homomorphism from 
P to H. It can be proved that this homomorphism is injective. This proves the Higman 
embedding theorem. Proving that H has isoperimetric function that polynomially depends 
on the time function of S is of course more complicated (see jl87j ) . Again it involves cutting 
diagrams into pieces which are treated separately. 

7. Open problems 

Here we collect the open problems mentioned in the text. It may be more convenient for 
the reader to have all the problems in one place. 

Problem 7.1 (See l2.2.F]l . Is there (time or space) complexity bound for the word problem 
in a finitely presented residually finite group. More precisely, what is the highest class 
in the time complexity hierarchy [83] where the word problem in every finitely presented 
residually finite group belongs? An even more bold question: is the word problem of every 
finitely presented residually finite group in NP? 

Problem 7.2 (D. Cohen [52], also attributed to Stallings, see l2.4.C]l . Is it true that for 
every finitely presented group G there exists a constant a > 1 such that 

/(n) < af(") 

where /, g are, respectively, the Dehn function and the isodiametric function of G. 

Problem 7.3 (See l3.1.N]l . Find a common cause of the theorems of Cartan-Hadamard type 
from l3T.NI 

Problem 7.4 (See l3.2.Dp . Find a combinatorial proof of Theorem 13.361 Is the Dehn 
function of the central square of Gio equivalent to v? log n? 

Problem 7.5 (Rips, see I4.1.G"]) . Is F automatic? 

Problem 7.6 (See l4.1.Gl) . Is F semihyperbolic? 

Problem 7.7 (E. Rips, see 14.2. Cp . Is the conjugacy problem solvable in every finitely 
presented group with quadratic Dehn function? 

Problem 7.8 (See I4.2.T1) . Can Remark 14.91 be applied to prove that automatic groups have 
decidable conjugacy problem? 

Problem 7.9 (See 15. ip . Is it true that every finitely generated matrix group over a field 
(in particular, the field of rational numbers) embeds into a finitely presented group with 
quadratic Dehn function? 
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Problem 7.10 (See I5.1( ). Does every finitely generated group with word problem in NP 
embed into a group all of whose asymptotic cones are simply connected? Equivalently (by 
Theorem 15 ■2p : does every finitely presented group with polynomial isoperimetric function 
embed into a group all of whose asymptotic cones are simply connected? 

About the list of references. After each reference in the bibliography we put numbers 
of sections where we cite this reference. Each of the section numbers is the highest section 
number preceding the reference. Say, Section number 4.1 means that the reference appears 
in Section 4.1, before 4.1. A. 
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